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This  dissertation  considers  asynchronous  direct-sequence  code-division  multiple- 
access  communication  systems  operating  over  additive  white  Gaussian  noise  as  well  as 
fading  channels.  Particular  attention  is  given  to  the  near-far  problem,  which  occurs 
when  the  received  signal  amplitudes  from  the  concurrent  users  are  very  dissimilar. 

A class  of  receivers,  called  linear  single-bit  single-user  receivers,  are  defined,  and 
their  properties  are  analyzed.  The  minimum  mean-squared  error  (MMSE)  receiver  is 
a particularly  attractive  receiver  in  this  class.  The  MMSE  receiver  is  investigated  in 
great  detail  and  is  shown  to  be  near-far  resistant  and  optimum  (in  the  error  probabil- 
ity sense)  when  the  interference  is  negligible.  Optimum  linear  complexity  reduction 
through  dimension  reduction  is  proposed  and  studied.  Unfortunately,  the  optimum 
scheme  requires  side-information  that  is  not  readily  available.  Several  practical,  but 
suboptimum,  schemes  are  therefore  proposed  and  evaluated. 

The  parameter  estimation  problem — the  problem  of  estimating  each  user’s  prop- 
agation delay,  received  amplitude,  and  phase — is  introduced,  and  the  Cramer-Rao 
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bound  (CRB)  is  derived.  The  CRB,  which  bounds  the  accuracy  with  which  the  pa- 
rameters may  be  estimated,  serves  as  the  optimality  criterion.  Optimum  linear  com- 
plexity reduction  through  dimension  reduction  is  developed.  Significant  complexity 
reduction  may  be  obtained;  however,  the  optimum  scheme  again  requires  knowledge 
of  unavailable  side-information.  The  optimum  maximum-likelihood  (ML)  estimator 
is  presented,  and  several  suboptimum  schemes — with  significantly  lower  complexity 
than  the  ML  estimator — are  proposed  and  studied.  The  proposed  estimators  are  ex- 
perimentally shown  to  be  robust  to  the  near-far  problem,  which,  in  turn,  is  shown  to 
be  fatal  to  the  previously  known  methods. 

The  performance  degradation  of  the  decorrelating  detector  caused  by  propagation 
delay  estimation  errors  is  investigated  in  theory  and  quantified  by  simulations.  It 
is  shown  that  very  accurate  propagation  delay  estimation  is  needed  to  retain  good 
performance. 
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CHAPTER  1 
INTRODUCTION 

1.1  Notation  and  Conventions 

The  Kronecker  delta  function  is  defined  as 

( 1 if  p = 0 

Mp)  = S 

1 0 otherwise 

Matrices  and  vectors  are  typeset  in  bold  face,  for  instance,  A or  v.  The  (i,j) th 
element  of  a matrix  A is  denoted  by  A With  diag(ai,  a2, . . . , aK)  we  mean  the 
K x K diagonal  matrix  whose  kth  diagonal  element  is  a*,,  i.e., 

(diag(a1,a2, . . . , aK))(k,  k))  = ak 

On  the  other  hand,  diag(A)  is  a diagonal  matrix  with  the  same  dimension  and  diag- 
onal as  A, 

(diag(A))(z,j)  = A (i,i)SK(i-j) 

The  transpose,  conjugate  transpose,  and  2-norm  of  a vector  x are  denoted  by  xT,  x* 
and  ||x||  = \/x*x,  respectively.  We  will  frequently  use  conjugate  transpose  notation, 
A*,  to  indicate  transpose  of  a real  quantity  A.  Conjugation  of  a scalar  a may  be 
denoted  by  either  a or  «*. 

Two  subspaces  associated  with  a matrix  A e (Dmxn  are  the  range  ( column  space) 

range(A)  = {Ax  : x € Cn} 

and  the  null  space 

null(A)  = {x  G C : Ax  = 0} 
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The  Fundamental  Theorem  of  Linear  Algebra  says  that  the  row  space  of  a matrix  A, 
i.e.,  range(A*),  is  the  orthogonal  complement  to  the  null  space  of  A, 

range(A*)  = (null(A))1- 

The  correlation  of  two  complex  random  variables  x and  y is  defined  as  E [xy\.  This 
definition  is  in  complete  analogy  with  the  real  case  [31].  However,  some  researchers  in 
the  communication  field  find  it  necessary  to  define  the  correlation  as  E[xy\/2,  when 
x and  y are  complex,  and  as  E[xy],  when  x and  y are  real  [36].  This  author’s  opinion 
is  that  this  convention  introduces  more  confusion  than  convenience  and  should  thus 
be  avoided. 

1.2  Introduction  to  Multiple- Access  Communication  Systems 

A multiple- access  communication  system  is  a system  that  serves  the  communi- 
cation needs  of  a population  of  concurrent  users.  An  example  of  a multiple-access 
system  is  a cellular  telephone  system.  We  will  only  consider  digital  communication  in 
this  dissertation.  As  deduced  from  its  name,  a digital  communication  system  trans- 
mits digital  information,  i.e.,  information  that  can  be  thought  of  as  a stream  of  binary 
symbols  (bits).  A natural  measure  of  the  quality  of  the  communication  link  is  the 
average  probability  of  error,  or  bit  error  rate,  since  the  smaller  the  bit  error  rate,  the 
less  distortion  is  introduced  in  the  transmitted  data. 

Radio-based  multiple-access  systems  are  typically  required  to  operate  inside  a 
certain  frequency  band  and  below  certain  mean  and  peak  powers,  i.e.,  the  transmitted 
signals  must  be  bandlimited  and  restricted  in  peak  and  mean  amplitude.  The  available 
bandwidth  and  allowed  mean  and  peak  powers  can  therefore  be  viewed  as  the  system 
resources.  On  the  other  hand,  we  can  quantify  the  users’  communication  demands 
by  the  needed  data  rate,  or  symbol  rate,  and  the  largest  tolerable  bit  error  rate.  For 
example,  a cellular  telephone  user  might  require  that  information  is  transmitted  at  a 
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data  rate  of  48  x 103  bits  per  second  and  with  a probability  of  error  less  than  10-3. 
A multiple-access  system  should  be  designed  such  that  each  user’s  communication 
demands  are  satisfied  while  using  up  a minimum  of  the  system  resources.  An  obvious 
performance  measure  for  multiple-access  systems  is  the  system  capacity , which  is 
defined  as  the  maximum  number  of  users  that  can  simultaneously  communicate  at  a 
specified  bit  error  rate. 

It  is  common  to  divide  multiple-access  systems  into  two  categories:  time-division 
multiple- access  (TDMA)  systems  and  frequency- division  multiple- access  (FDMA)  sys- 
tems. In  an  FDMA  system,  the  available  bandwidth  is  divided  into  frequency  chan- 
nels, and  each  user  is  allowed  to  transmit  continuously  in  its  assigned  channel.  Con- 
versely, in  a TDMA  system  the  time-axis  is  divided  into  time  slots , and  a user  is 
allowed  to  use  the  entire  system  bandwidth  during  the  assigned  time  slot. 

Hybrids  of  FDMA  and  TDMA  exist,  e.g.,  the  American  digital  cellular  standard 
(ADC),  in  which  the  system  bandwidth  is  divided  into  30  kHz  frequency  channels, 
and  each  frequency  channel  is  divided  into  three  time  slots.  Thus,  three  TDMA  users 
reside  in  each  frequency  channel. 

A third  form  of  multiple-access,  code-division  multiple- access  (CDMA),  has  re- 
cently gained  significant  interest.  In  a CDMA  system,  all  users  are  allowed  to  use 
the  entire  system  bandwidth  at  all  times.  The  separation  between  users  is  neither 
done  in  time  nor  in  frequency,  but  rather  with  codes.  The  roots  of  these  types  of  sys- 
tems can  be  traced  back  to  the  1920s  [43];  however,  until  recently,  CDMA  has  almost 
exclusively  been  used  in  military  applications.  CDMA  systems  can  be  divided  into 
several  subclasses,  e.g.,  frequency  hopping  (FH-CDMA)  systems  and  direct-sequence 
(DS-CDMA)  systems.  Note,  however,  that  other  types  of  CDMA  and  hybrids  of  FH- 
CDMA  and  DS-CDMA  exist,  e.g.,  the  joint  tactical  distribution  system  (JTIDS)  [43]. 

The  manner  in  which  FDMA,  TDMA,  and  CDMA  users  are  allocated  system 
resources  is  illustrated  in  Figure  1.1.  The  total  capacity  of  the  system  is  represented  by 
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a volume  spanned  by  the  three  axis  labeled  time,  frequency,  and  code.  The  shadowed 
region  represents  the  fraction  of  the  system  resources  that  is  allocated  to  a particular 
user. 


TDMA  FDMA  CDMA 


Figure  1.1:  TDMA,  FDMA  and  CDMA  system  resources  allocation. 

In  FH-CDMA,  the  transmitted  signal  is  a narrowband  signal  whose  carrier  fre- 
quency is  changed  (hopped)  many  times  during  the  transmission.  If  the  hopping  rate 
is  greater  than  the  symbol  rate,  then  this  is  called  fast  frequency  hopping.  Conversely, 
if  the  hopping  rate  is  less  than  the  symbol  rate,  then  this  is  called  slow  frequency 
hopping.  The  hopping  pattern  is  defined  by  a user-unique  code,  and  the  receiver 
(which  is  granted  knowledge  of  the  code)  hops  synchronously  with  the  transmitter. 
An  unauthorized  receiver,  however,  will  have  difficulty  tracking  the  hops  (especially 
when  fast  hopping  is  used).  The  hopping  makes  the  system  more  secure  and  robust 
against  intentional  or  unintentional  jamming.  Every  now  and  then,  two  or  several 
users  are  going  to  hop  simultaneously  to  the  same  frequency.  This  will  garble  the  re- 
ceived signal  and  potentially  destroy  reception  until  the  next  hop.  However,  reliable 
communication  can  still  be  ensured  if  this  situation  does  not  occur  too  frequently  and 
error  correcting  coding  is  used. 

In  DS-CDMA,  the  data  signal — which  is  a rectangular  waveform  whose  polarity 
is  equal  to  the  transmitted  bit — is  multiplied  with  a rectangular  code  waveform  that 


5 


changes  polarity  at  a rate  much  higher  than  the  symbol  rate.  The  multiplication  of 
the  data  signal  by  the  code  waveform  is  called  spreading.  The  standard  receiver  for 
DS-CDMA  negates  the  spreading  by  multiplying  the  received  signal  with  a copy  of 
the  desired  user’s  code  waveform.  Consequently,  this  operation  is  called  despreading. 
However,  even  after  despreading,  the  signal  will  contain  interference  from  the  other 
concurrent  users.  The  amount  of  interference  is  proportional  to  the  crosscorrelations 
between  the  desired  user’s  code  waveform  and  the  interfering  users’  code  waveforms. 
Thus,  reliable  communication — when  using  the  standard  receiver — is  possible  only  if 
the  code  waveforms  have  sufficiently  low  crosscorrelations. 

In  both  FH-CDMA  and  DS-CDMA,  the  transmitted  signal  will  occupy  a band- 
width that  is  much  larger  than  the  bandwidth  of  the  original  data  waveform.  CDMA 
systems  are  therefore  sometimes  referred  to  as  spread  spectrum  systems. 

TDMA,  FDMA,  and  CDMA  have  the  same  capacity  if  we  model  the  channel  as  a 
synchronous  additive  white  Gaussian  noise  (AWGN)  channel  [5].  However,  this  is  a 
very  simplistic  and  ideal  model.  For  instance,  a radio  channel  can  seldom  be  modeled 
accurately  as  an  AWGN  channel.  The  propagation  of  a radio  signal  will  often  take 
several  different  paths  to  the  receiver — so-called  multipath  propagation — which  causes 
distortion  to  the  received  signal.  A channel  that  distorts  the  transmitted  signal  is 
called  a fading  channel.  A very  general  method  to  characterize  a channel  is  to  model 
the  channel  as  a linear  filter  with  an  unknown  time- varying  impulse  response  [4], 
The  frequency  response  of  the  channel  (i.e.,  the  Fourier  transform  of  the  impulse 
response)  typically  has  several  nulls,  or  fade  dips.  We  know,  from  elementary  linear 
system  theory,  that  the  received  signal  is  equal  to  the  convolution  of  the  transmitted 
signal  and  channel  impulse  response,  and  furthermore  that  the  power  spectral  density 
(PSD)  of  the  received  signal  is  equal  to  the  product  of  the  transmitted  signal  PSD 
and  the  channel  frequency  response. 

In  FDMA  systems,  the  transmitted  signal — which  is  narrowband — can  be  com- 
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pletely  wiped  out  by  a fade  dip,  and  communication  is  thereby  made  impossible. 
On  the  other  hand,  TDMA  signals  are  more  broadband  than  FDMA  signals  and  are 
therefore  not  as  vulnerable  to  fade  dips.  However,  if  the  length  of  the  impulse  re- 
sponse is  comparable  to  the  symbol  duration,  the  signal  from  a TDMA  user  will  be 
noticeably  smeared  out  in  time,  and  parts  of  the  signal  will  leak  into  the  next  user’s 
time  slot.  The  users  will  therefore  interfere  with  each  other.  This  problem  can  be 
alleviated  by  appending  each  time  slot  with  a guard  interval — a period  of  time  during 
which  no  user  is  allowed  to  transmit.  However,  the  use  of  guard  intervals  will  decrease 
capacity,  since  no  information  can  be  exchanged  during  the  guard  interval. 

Since  the  fading  is  time-variant,  the  location  and  depth  of  the  fade  dips  will 
change  in  time.  To  combat  fading,  the  system  must  use  error  correction  coding  or 
diversity  techniques.  With  diversity  we  mean  that  several  copies  of  a user’s  signal 
is  transmitted  over  independently  fading  channels.  We  can  then  greatly  reduce  the 
bit  error  rate  by  combining  the  received  signals  from  the  channels.  Diversity  can  be 
achieved  in  many  ways.  Firstly,  we  can  transmit  copies  of  the  signal  spaced  in  time 
over  the  same  frequency  channel  ( time  diversity ).  If  the  spacing  in  time  is  greater 
than  the  time  it  takes  for  the  channel  impulse  response  to  change  significantly,  we 
can  assume  that  the  signal  copies  fade  independently.  Secondly,  we  can  use  several 
antennas  that  are  located  sufficiently  far  away  from  each  other  to  ensure  independent 
propagation  paths  ( spatial  diversity).  Thirdly,  we  can  transmit  the  signal  over  several 
frequency  channels  that  are  spaced  sufficiently  in  frequency  ( frequency  diversity). 

It  can  be  argued  that  CDMA  is  more  robust  against  fading  than  FDMA  or 
TDMA  [54,  16,  21,  36].  Heuristically,  this  can  be  done  as  follows.  Since  DS-CDMA 
signals  are  much  more  broadband  than  FDMA  signals,  fade  dips  cannot  completely 
wipe  out  a DS-CDMA  signal.  Moreover,  DS-CDMA  is  less  sensitive  to  a long  channel 
impulse  response  than  TDMA,  since  the  symbol  rate  is  much  slower  in  the  DS-CDMA 
case.  It  can  be  seen  that  FH-CDMA  has  an  inherent  frequency  diversity,  since  it  hops 
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between  frequency  channels.  A fade  dip  will  therefore  only  affect  the  signal  until  the 
next  hop.  Furthermore,  when  a user  has  hopped  from  a frequency  channel,  the  next 
time  slot  in  that  channel  is  not  always  used.  Thus,  the  interference  between  users  due 
to  a long  channel  impulse  response  is  less  in  an  FH-CDMA  system  than  in  a TDMA 
system. 

Another  advantage  of  CDMA  systems  over  TDMA  and  FDMA  systems  is  the 
concept  of  graceful  degradation.  In  FDMA  and  TDMA  systems,  the  maximum  number 
of  users  is  fixed  by  the  number  of  available  frequency  channels  (FDMA)  or  time  slots 
(TDMA).  We  simply  cannot  add  a user  when  all  channels  or  time  slots  are  occupied. 
Furthermore,  the  bit  error  rate  of  a particular  FDMA  or  TDMA  user  is  independent 
of  the  number  of  concurrent  users — at  least  in  the  ideal  AWGN  case.  However,  in 
CDMA  there  is  no  rigid  bound  on  the  maximum  number  of  users.  This  is  since 
the  bit  error  rate  is  a function  of  the  number  of  active  users;  by  adding  a user 
the  performance  is  degraded  for  all  active  users,  but  the  system  will  not  collapse. 
Now,  if  the  communication  needs  of  the  users  are  of  bursty  nature — for  instance  in  a 
telephone  situation  when  only  one  (well,  ideally  only  one)  user  talks  at  a time — we 
can  discontinue  transmission  during  the  time  when  communication  is  not  needed. 
Capacity  is  thereby  increased  since  there  are  fewer  active  users  at  any  given  time. 
Discontinuous  transmission  can  be  used  to  increase  the  capacity  of  FDMA  and  TDMA 
systems  as  well,  but  only  with  considerably  more  difficulty. 

Yet  another  argument  can  be  done  for  CDMA  if  we  consider  cellular  systems.  A 
cellular  system  is  a system  in  which  the  geographical  area  to  be  covered  is  divided 
into  cells,  and  the  users  in  each  cell  communicate  with  a central  base  station.  Cellular 
systems  are  motivated  by  two  main  reasons. 

Firstly,  the  distance  between  a user  and  the  base  station  is  shorter  in  a multicell 
system  than  in  a single  cell  system.  The  transmitted  power  from  the  mobile  unit  can 
thus  be  reduced.  Reducing  the  transmitted  power  prolongs  battery  life  and  reduces 
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the  physical  size  of  the  mobile  device. 

Secondly,  and  more  importantly,  is  the  concept  of  frequency  reuse.  By  frequency 
reuse  we  mean  that  some  cells  in  the  system  use  the  same  frequency  band.  Capacity  is 
thereby  increased  since  the  effective  bandwidth  of  the  system  is  increased.  However, 
transmissions  from  frequency  reusing  cells  will  interfere  with  each  other — so-called 
cochannel  interference.  Capacity  is  increased  by  shortening  the  distance  between 
frequency  reusing  cells,  however,  cochannel  interference  is  also  increased.  Thus,  the 
smallest  possible  reuse  distance,  and,  thereby,  the  largest  capacity  gain,  is  dependent 
on  how  robust  the  multiple-access  system  is  toward  interference.  CDMA  is  less  sensi- 
tive to  cochannel  interference  than  FDMA  or  TDMA,  and  CDMA  might  therefore  be 
the  best  choice  for  a cellular  system.  Two  cellular  systems  with  frequency  reuse  are 
depicted  in  Figure  1.2.  The  shadowed  cells  are  using  the  same  frequencies  and  will 
therefore  interfere  with  each  other.  Note  that  the  frequency  reuse  distance  is  shorter 
in  the  left  system  than  in  the  right  system.  Thus,  if  we  can  neglect  the  cochannel 
interference,  the  system  to  the  left  will  have  the  larger  capacity.  However,  the  cochan- 
nel interference  will  also  be  larger  in  the  leftmost  system.  Hence,  the  actual  capacity 
difference  is  dependent  on  how  sensitive  the  multiple-access  system  is  to  cochannel 
interference.  As  a matter  of  fact,  the  rightmost  system  might  even  have  the  larger 
capacity  for  some  multiple-access  methods. 


Figure  1.2:  Cellular  multiple-access  systems  with  frequency  reuse. 


The  question  of  which  multiple-access  system  is  the  best  is  often  subject  to  heated 
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debate.  Each  system  has  its  own  unique  implementation  problems,  and  the  choice  of 
multiple-access  method  is  strongly  dependent  on  application  issues,  such  as  the  chan- 
nel, the  users’  communication  demands,  and  the  applicable  regulations,  standards 
and  laws. 

Nice  summaries  of  the  early  work  on  CDMA  systems  have  been  made  by  Pickholtz, 
Schilling  and  Milstein  [33]  and  by  Scholtz  [42].  A thorough  tutorial  concerning  the 
properties  of  code  sequences  has  been  written  by  Sarwate  and  Pursley  [39].  Text 
books  on  CDMA  include  Dixon  [10],  Simon,  Scholtz,  Levitt  and  Omora  [44],  and 
Ziemer  and  Peterson  [63].  Most  modern  books  on  digital  communications  contain  a 
treatment  on  CDMA,  e.g.,  Proakis  [36],  Lee  and  Messerschmitt  [20],  and  Blahut  [5]. 

The  area  of  CDMA  research  is  very  active,  and  it  is  impractical  to  list  all  related 
work  here.  However,  in  the  chapters  to  come,  we  will  attempt  to  cite  all  papers  that 
are  relevant  to  the  problem  at  hand. 

1.3  Dissertation  Outline 

This  dissertation  is  divided  into  two  parts.  The  first  part,  Chapters  2-5,  assumes 
a AWGN  channel,  and  the  second  part,  Chapters  6-7,  assumes  fading  channels. 

It  may  be  argued  that  the  first  part  is  redundant  since  the  AWGN  channel  is 
a special  case  of  fading  channels.  However,  it  was  decided  to  keep  the  treatment 
separate  for  two  reasons.  Firstly,  the  AWGN  channel  is  much  easier  to  analyze, 
and  there  are  applications  in  which  the  channel  is  accurately  modeled  as  an  AWGN 
channel.  Secondly,  there  is  a much  better  understanding  of  the  AWGN  channel.  For 
instance,  there  is  not  even  a universally  accepted  model  for  a fading  channel. 

Parts  of  the  original  work  presented  in  this  dissertation  have  also  been  published 
at  various  IEEE  conferences:  [47,  52,  49,  51,  48,  50]. 


CHAPTER  2 

SYSTEM  MODEL  FOR  THE  AWGN  CHANNEL 


The  objective  of  this  chapter  is  to  formulate  a convenient  model  for  an  asyn- 
chronous DS-CDMA  communication  system  operating  over  an  AWGN  channel. 

The  original  work  in  this  chapter  is  threefold:  (1)  the  formulation  of  the  discrete- 
time complex  vector  model,  as  depicted  in  Figure  2.2  and  defined  by  (2.17),  (2)  the 
derivation  of  the  eigenvalue  decompositions  (2.29)  and  (2.37),  and  (3)  the  statement 
and  proof  of  Theorem  2.3. 

This  chapter  contains  many  fundamental  definitions  and  algebraic  manipulations, 
but  very  little  discussion  or  new  results.  However,  the  author  hopes  that  the  dryness 
of  this  chapter  will  neither  put  the  readers  to  sleep  nor  deter  them  from  continuing 
to  read  the  more  interesting  chapters  to  come. 

2.1  Continuous-Time  System  Model 

The  system  under  consideration  is  modeled  as  an  asynchronous  K- user  DS-CDMA 
system  operating  over  an  AWGN  channel.  The  modulation  scheme  is  binary  phase- 
shift  keying  (BPSK)  with  bit  duration  T and  chip  duration  Tc  = T/N,  where  N is  an 
integer.  The  code  waveforms  are  assumed  to  be  periodic  with  period  T.  As  a general 
rule,  a subscript  k implies  that  the  subscripted  quantity  is  due  to  the  kth  user.  For 
instance,  a period  of  the  kth  user’s  code  waveform  is  denoted  by  bk(t),  where  bk(t)  = 0 
for  t <£  [0,  T). 

The  baseband  signal,  sk(t),  is  formed  by  pulse  amplitude  modulating  the  data 
stream,  dk(m ) £ { — 1, 1},  with  a period  of  the  code  waveform,  i.e. , 

OO 

Sk{t)  = 5Z  dk(m)bk(t  — mT)  (2.1) 
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The  transmitted  signal  is  formed  by  multiplying  sk(t)  with  the  carrier  y/ 2 Pk  cos  (ujct  + 
0k),  where  Pk  is  the  power  and  6'k  is  the  random  carrier  phase  uniformly  distributed 
in  [0,  27t).  We  assume,  without  loss  of  generality,  that  P\  — l and  Tc  = 1. 

The  received  signal  may  be  written  as 

r(t)  = Re  sk(t  ~ Tk)\Jw~kexp(j(ujct  + 6>fc))|  +n(t)  (2.2) 

where  rk  G [0,  T)  is  the  unknown  propagation  delay  and  6k  = 9'k  — ujcTk.  The  noise 
waveform,  n(t),  is  a white  Gaussian  noise  waveform  with  two-sided  power  spectral 
density  N0/2. 

2.2  Discrete-Time  Complex  Vector  System  Model 

The  receiver  front-end  consists  of  a standard  IQ-mixing  stage  followed  by  an 
integrate-and-dump  section,  as  shown  in  Figure  2.1.  The  integration  time,  Tt,  is 
defined  as  T*  = Tc/Q  where  Q is  an  integer  and  is  referred  to  as  the  oversampling 
factor. 


Figure  2.1:  Receiver  front-end. 

Ignoring  double  frequency  terms,  the  equivalent  complex  received  sequence,  r(l)  = 
r/(0  + 3rQif)i  can  be  expressed  as 

K / — 1 rlTi 

r{l)  = n(l)  + J2\/Pkexp {jdk)—  / sk(t  - rk)  dt 


(2.3) 
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where  n(l)  is  a zero-mean  white  complex  Gaussian  sequence  with  variance 

No  = QNNp 

Ti  Eb>1 

where  Ebti  = T is  the  energy  per  bit  for  the  first  user. 

Let  the  received  vector  during  the  mth  bit  interval,  r(m)  G CQAr,  and  the  noise 
vector,  n(m)  G CQJV,  be  defined  as 

r(m)  = [r{mQN  + QN)  r(mQN  + QN  — 1)  •••  r{mQN  + 1)  ]T  (2.5) 

n(m)  = [n(mQN  + QN)  n(mQN  + QN  — 1)  •••  n(mQN+l)]T  (2.6) 

It  is  easy  to  show  that  the  noise  vector  is  a zero-mean  complex  Gaussian  random 
vector  with  second  moment  matrices 


(2.4) 


E[n(p)n*(g)]  = a2IQN8K(p  - q),  E[n(p)nT(g)]  = 0 (2.7) 


After  some  straightforward  calculations  we  can  formulate  the  contribution  from 
the  kth  user  to  r(m)  as 


rt(m)  = [a2/b-i  a2t 


1 

0 

1  

' z2k-i(my 

o 

1 

. z2k{m ) 

(2.8) 


where 


Z2k-i(m)  = dk(m ) + dk(m  - 1)] 

(2.9) 

Z2k{m)  = ~[dk{m)  - dk(m  - 1)] 

(2.10) 

(Ik  = V^exp(i0fc) 

(2.11) 

From  (2.8)  we  see  that  the  kth  user  contributes  dk(m)/3k a2fe_i,  if  dk(m ) = dk(rn  — 1), 
or  dk(m)pk a2fc,  if  dk(m)  = —dk(m  — 1).  The  vectors  {a2fc_i,a2fc}  are  defined  by  the 
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kth  user’s  propagation  delay  and  code  waveform  as 


a2fc-l  — 


a2  k 


jrD(Pfc  + 1)1)+  fl  ~ 1) 


C k 


Y^(.Pk  + 1)  _i)  + fi  ~ y)  D(pfc>  _i) 


C k 


(2.12) 

(2.13) 


where  rk  = + <5*,  such  that  pk  is  an  integer  and  5k  G [0,  7*),  and  cfc  € IR0Ar  is 

defined  as 


ck  = [ck(QN)  ck(QN  — 1) 
1 rlTi 


Ck{  l)f 


Cfc(0  = Y [ \ 
U J(l- l)T{ 


dt 


The  matrix  D(r,  s)  G {0,  IJQWxQv  is  defined  in  block  form  as 

0 ^-QN-r 


D(r,s)  = 


sL  0 


We  adopt  the  conventions  that  D(0,  s)  = Iqat  and  D (QN,  s)  = sIQN. 
The  expression  for  r(m)  can  be  written  in  a more  compact  form  as 


K 


r(m)  = n(m)  + ^ r k(m)  = A(r)B(7,  0)z(m)  + n(m) 

fc=i 


(2.14) 

(2.15) 


(2.16) 


(2.17) 


where 


A(r)  = [ai  a2  •••  a2K  ] G JRQNx2K  (2.18) 

B(7,  6)  = diag(/31>  ft,  ft,  A, . . . , /?* , /?*)  6 <D2*x2*  (2.19) 

z(m)  = [zi(m)  z2(m)  •••  z2K(m)  ]T  G {-1, 0,  l}2*  (2.20) 


We  observe  that  A(r)  and  B(7,  0)  are  functions  of  the  vectors  r,7,  0 G IR^, 


r = [n 

r2  . . 

iT 

• r*] 

(2.21) 

0 = [*i 

e2  ■ 

••  0k]T 

(2.22) 

7 = [Ti 

72  • 

7k  ]T 

(2.23) 
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where  7*,  = y/Pk-  The  explicit  dependence  of  A and  B on  the  parameter  vectors  are 
dropped  for  notational  convenience.  We  will  assume  that  A and  B have  full  rank.  The 
A matrix  will  have  full  rank  if  and  only  if  {ai,  a2, . . . , a# } are  linearly  independent 
for  all  possible  values  of  r.  This  is  obviously  desirable  for  a DS-CDMA  system  since 
users  otherwise  can  cancel  each  other’s  transmissions.  The  B matrix  will  be  of  full 
rank  if  Pk  > 0 for  all  k,  which  clearly  is  the  case.  The  final  discrete-time  complex 
vector  model  is  depicted  in  Figure  2.2. 


&2k 

Figure  2.2:  Discrete-time  complex  vector  model 


2.3  Properties  of  the  Correlation  Matrix 
The  correlation  matrix  for  r(m)  is 

R = E[r(m)r*(m)]  = AB  E[z(m)z*(ra)]B*A*  + g2\qn  = ASA*  + a2lQN  (2.24) 

where  t*,,  9k,  and  7 are  considered  to  be  unknown  and  deterministic  parameters.  We 
assume  that  the  data  streams  consist  of  equally  likely  independent  bits;  furthermore, 
we  assume  that  the  noise  is  independent  of  the  data  streams.  It  is  then  easy  to  show 
that 

S = B E[z(m)z*(m)]B*  = i diag(Pi,  Pi, ... , PK,  Pk)  £ ]R2Kx2K 


(2.25) 
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Note  that  R is  symmetric  and  that  R is  positive  definite  if  a2  > 0. 

Since  ASA*  is  real,  symmetric,  positive  semidefinite,  and  of  rank  2 K,  there  exists 
an  eigenvalue  decomposition  of  ASA*  such  that 


ASA* 


[E, 


(2.26) 


The  matrices  Es  £ ]RQNx2K,  En  £ jrQNx(Qn~2K)  are  such  that  [Es  E„  ] £ ]^QNxQN 
is  orthogonal.  Furthermore,  A'  = diag(Ai,  A2, . . . , X2k)  £ IR2^*2^  js  a diagonal 
matrix  of  the  2 K nonzero  and  positive  eigenvalues  of  ASA*. 

We  define  the  signal  subspace  to  be  the  subspace  spanned  by  the  columns  of  A, 
denoted  by  range(A),  and  the  noise  subspace  to  be  the  orthogonal  complement  to 
the  signal  subspace.  From  (2.26),  it  follows  that  range(Es)  = range(ASA*),  and  the 
Theorem  2.1  below  asserts  that  range(A)  = range(ASA*).  Thus,  after  proving  the 
theorem,  we  conclude  that  the  signal  subspace  is  equal  to  range (Es). 


Theorem  2.1  Suppose  A <E  IR0^*2*  and  S £ (T2Kx2K , QN  > 2 K,  are  full  rank 
matrices.  Then  range(A)  = range(ASA*). 

Proof:  Since  (range(X))x  = null(X*),  it  is  sufficient  to  show  that  null(A*)  = 
null(ASA*),  to  prove  the  theorem. 

Suppose  u £ null(A*),  then  A*u  = 0 and 


(ASA*)u  = 0 (2.27) 

Therefore,  u £ null(ASA*)  or,  equivalently,  null(A*)  C null(ASA*). 

Suppose  u £ null(ASA*),  then 

0 = ASA*u=  (AS)(A*u)  (2.28) 


Since  A and  S have  full  rank,  it  follows  that  AS  has  full  rank,  and  (2.28)  then  implies 
that  A*u  = 0.  Hence,  u £ null(A*)  or,  equivalently,  null(ASA*)  C null(A*),  and 
the  theorem  follows.  ■ 
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It  is  easy  to  see  that  an  eigenvector  of  ASA*  is  also  an  eigenvector  of  R.  Thus, 
we  can  write  an  eigenvalue  decomposition  for  R as 


R = ASA*  + a2lQN  = [E, 


0 

0 


= [E  s 


E „]• 


(2.29) 


where  As  = diag(Ai  + a2, . . . , X2K  + cr2)  and  An  = a2lQN_2K. 

For  reasons  that  will  be  explained  later,  we  will  now  consider  some  of  the  properties 
of  the  correlation  matrix  for  r v(m)  = V*r (m).  The  matrix  V G (DQArxi  js  such  that 
V*V  = IL,  which  implies  that  L < QN  and  that  V has  full  rank.  Furthermore, 
we  assume  that  the  signal  subspace  is  contained  in  the  column  space  of  V,  i.e., 
range(Es)  C range(V).  The  correlation  matrix  Rv  e IRLxL  for  vv{m)  is  then 


Rv  = E[ry(m)r^(m)]  = V*ASA*V  + a2V*V  = V*ASA*V  + a2IL  (2.30) 


The  following  theorems  reveal  some  interesting  properties  of  the  correlation  matrix. 


Theorem  2.2  Suppose  V e (^QNxl  ig  guch  that  y*y  _ ^ and  range(Es)  C 
range(V)  then 


VV*u  = u 


(2.31) 


for  all  u G range(Es). 

Proof:  Since  V*V  = 1^,,  the  columns  of  V form  an  orthonormal  basis  for  the 
subspace  range(V)  D range(Es).  We  can  therefore  express  u as  a linear  combination 
of  the  columns  of  V, 


L 


u = Z WVJ 

j= 1 


(2.32) 


where  Wj  is  the  jth  column  of  V.  Hence, 

Wu  = VV*  ■£  W = E GjW'Vj  = EaJVei  = Ea,v1=u 

j— 1 3= 1 3= 1 3= 1 


where  ej  is  the  jth  unit  vector. 


(2.33) 
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Theorem  2.3  Suppose  an  eigenvalue  decomposition  of  ASA*  is  given  by  (2.26), 
then  an  eigenvalue  decomposition  of  V*ASA*V,  where  V e <^9NxL  is  such  that 
V*V  = 1L  and  range(Es)  C range(V),  can  be  written  as 


V*ASA*V  = [ V*ES 


0 


E 


n,V] 


(2.34) 


Proof:  In  order  to  prove  the  theorem,  it  is  sufficient  to  show  that  the  columns  of 
V*ES  are  eigenvectors  to  V*ASA*V  with  corresponding  eigenvalues  Aj,  A2, . . . , A2 k- 
Let  Xj  — V*Uj  where  is  the  jth  column  of  Es.  Since  e range(Es)  C 
range(V)  = (nullfV*))-1-  it  follows  that  Xj  ^ 0.  Consider 

V*ASA*Vxj  = V*ASA*VV*Uj  = V*ASA*u,  (2.35) 

since,  according  to  Theorem  2.2,  VV*Uj  = u;.  Now,  since  is  an  eigenvector  of 
ASA*  with  eigenvalue  Xj,  it  follows  that 


V*ASA  *uj  = A jV*Uj  = A jXj 


(2.36) 


Thus,  Xj  is  an  eigenvector  of  V*ASA*V  with  eigenvalue  Xj  for  j = 1,2,...,  2 K. 
Now,  since  rank(ASA*)  = 2 K,  we  know  that  rank(V*ASA*V)  < 2 K and  therefore 
V*ASA*V  will  have  at  least  L — 2 K zero  eigenvalues.  Moreover,  since  we  just 
showed  that  there  are  2 K nonzero  eigenvalues,  V*ASA*V  will  have  exactly  L - 2K 
zero  eigenvalues.  The  theorem  follows  if  we  let  E n<v  be  an  L x (L  — 2K)  matrix  whose 
columns  are  eigenvectors  of  V* ASA* V corresponding  to  the  zero  eigenvalues.  ■ 


We  note  that  an  eigenvector  to  V*ASA*V  with  eigenvalue  Xj  is  also  an  eigenvector 
to  Ry  with  eigenvalue  + a2.  Hence,  we  can  write  the  an  eigenvalue  decomposition 
of  Rv  as 


where  As 


Rv  = [V*Es 


0 


. 0 A ny 

diag(Aj  + a2, . . . , A2 k + o-2)  and  A ny 


[V*E,  En>v]* 

= O'2!  L-2K- 


(2.37) 


CHAPTER  3 

RECEIVERS  FOR  THE  AWGN  CHANNEL 

3.1  Introduction  to  DS-CDMA  Receivers 

The  DS-CDMA  receiver  problem  has  enjoyed  a lot  of  attention  during  the  last 
decade,  and  a tremendous  of  number  papers  have  been  published.  The  following 
introduction  is  not  to  be  treated  as  a comprehensive  review  of  all  research  in  this 
field  but  merely  as  an  indicator  of  major  trends.  The  terms  receiver  and  detector 
will  be  used  interchangeably  to  denote  a device  that  produces  an  estimate  of  the 
transmitted  bit  sequence  by  processing  the  received  signal.  As  mentioned  in  the 
preceding  chapter,  the  optimality  criterion  of  a detector  is  the  probability  of  error 
(or,  equivalently,  bit  error  rate),  i.e.,  the  probability  that  the  estimated  bit  is  not 
equal  to  the  transmitted  bit. 

A summary  of  the  state-of-the-art  in  the  early  1980s  has  been  written  by  Pick- 
holtz,  Shilling  and  Milstein  [33].  In  1986,  Verdu  found  the  optimum  receiver  for 
DS-CDMA  systems  operating  over  AWGN  channels  [57].  Verdu  showed  that  a re- 
markable performance  gain  may  be  achieved  if  the  optimum  receiver  is  used  instead 
of  the  standard  matched  filter  receiver. 

The  front-end  of  the  optimum  receiver  is  a bank  of  K filters.  Each  filter  is  matched 
to  a particular  user’s  code  waveform  and  is  sampled  with  respect  to  that  user’s  prop- 
agation delay.  The  front-end  output  is  a sufficient  statistic  for  detecting  the  data 
sequences  from  all  K users  [57].  This  means  that  the  receiver  front-end  will  not  dis- 
card any  information  about  the  data  sequences  while  processing  the  received  signal  [6]. 
The  maximum-likelihood  estimates  of  the  transmitted  data  sequences  are  obtained 
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by  processing  the  sufficient  statistic  with  a Viterbi  algorithm  [58].  However,  the  op- 
timum receiver  suffers  from  two  problems:  (1)  exponentially  growing  complexity  in 
the  number  of  users  and  (2)  the  need  for  perfect  side-information,  i.e.,  knowledge  of 
the  number  of  users  and  of  each  user’s  code  waveform,  propagation  delay,  received 
phase,  and  amplitude.  The  commonly  accepted  opinion  is  that  the  optimum  receiver 
has  little  practical  value,  mainly  because  of  its  complexity.  This  opinion  motivates 
research  aimed  at  devising  suboptimum  detectors  that  have  two  properties:  (1)  less 
complexity  than  the  optimum  receiver  and  (2)  better  performance  than  the  standard 
receiver. 

Lupas,  a student  of  Verdu’s,  has  proposed  a detector  called  the  decorrelating 
receiver  [23,  24,  22],  The  performance  of  the  decorrelating  receiver  is  almost  always 
superior  to  that  of  the  standard  receiver,  and  complexity  grows  linearly  with  the 
number  of  users.  However,  significant  side-information  is  needed,  e.g.,  knowledge  of 
the  number  of  users  and  of  each  user’s  code  waveform,  propagation  delay,  and  received 
phase.  The  decorrelating  receiver  is  a linear  receiver  in  the  sense  that  the  decision  is 
made  as  the  sign  of  a linear  function  of  the  received  signal.  We  will  further  discuss 
the  Lupas  decorrelating  receiver  in  Chapter  5. 

A nonlinear  method,  called  the  multistage  receiver , has  been  proposed  by  Varanasi 
and  Aazhang  [56].  We  can  view  the  multistage  receiver  as  an  interference  cancellation 
scheme.  To  understand  this  concept,  consider  the  output  of  the  /cth  user’s  matched 
filter.  The  matched  filter  output  will  contain  a term  from  the  desired  user  (i.e.,  the 
kih  user)  and  an  interference  term  due  to  the  other  K - 1 users.  Clearly,  if  we  knew 
this  interference  term,  we  could  simply  subtract  it  from  the  matched  filter  output 
and  form  the  estimate  of  the  kth  user’s  data  signal  as  the  sign  of  the  interference- 
free  filter  output.  This  strategy,  as  a matter  of  fact,  has  optimum  performance. 
However,  in  practice,  the  interference  is  unknown  and  must  therefore  be  estimated. 
Thus,  an  interference  cancellation  receiver  is  defined  as  a receiver  that  subtracts  the 
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received  signal  with  an  estimate  of  the  interference  before  making  the  final  decision. 
In  addition  to  the  multistage  receiver,  many  other  interference  cancellation  schemes 
have  been  proposed,  see  for  instance  [11,  61]  and  the  references  therein. 

DS-CDMA  receivers  can  be  classified  in  many  categories.  As  mentioned  above, 
the  receiver  is  either  linear  or  nonlinear.  We  can  also  divide  DS-CDMA  receivers  into 
single-user  and  multiuser  detectors.  A multiuser  receiver  detects  several  users’  data 
signals  at  the  same  time,  whereas  a single-user  receiver  is  concerned  with  only  one 
particular  user’s  data  signal.  Finally,  we  distinguish  between  single-bit  and  sequence 
detectors.  A sequence  detector  accumulates  the  received  signal  for  several  bit  du- 
rations and  makes  a decision  of  several  bits  at  once;  however,  a single-bit  detector 
makes  its  decisions  on  a bit-by-bit  basis.  According  to  these  definitions,  the  stan- 
dard receiver  is  a linear  single-user  single-bit  detector;  the  optimum  and  multistage 
receivers  are  nonlinear  multiuser  sequence  detectors;  and  the  decorrelator  receiver  is 
a linear  multiuser  sequence  detector. 

The  scope  of  this  chapter  is  limited  to  the  class  of  linear  single-bit  single-user 
receivers.  In  Section  3.2,  this  class  of  receivers  is  defined  in  terms  of  a generic  re- 
ceiver, and  an  expression  for  the  error  probability  is  derived.  The  near-far  problem  is 
introduced,  and  two  performance  measures,  called  asymptotic  efficiency  and  near-far 
resistance,  are  defined  and  formulas  for  computing  these  are  derived.  The  standard, 
decorrelating,  and  minimum-mean  square  error  (MMSE)  receivers,  which  are  special 
cases  of  the  generic  receiver,  are  introduced  in  Sections  3.3,  3.4,  and  3.5,  respectively. 
Even  though  the  decorrelating  receiver  studied  in  this  chapter  is  the  single-user  single- 
bit version  of  the  Lupas  decorrelating  receiver,  the  receivers  are  quite  different  and 
should  not  be  confused  with  each  other. 

The  properties  of  the  MMSE  receiver  are  derived  and  discussed  in  Section  3.5.2. 
The  problem  of  complexity  reduction  of  the  MMSE  receiver  through  dimension  re- 
duction is  presented  in  Section  3.6;  the  optimum  scheme  is  derived  and  several  sub- 
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optimum  methods  are  proposed.  In  Section  3.6.5,  we  present  simulation  results  that 
compare  the  proposed  dimension  reduction  strategies. 

A very  simple  modification  can  be  made  to  the  MMSE  receiver  that  lowers  the 
complexity  and  also  increases  the  performance.  Technically,  this  modification  renders 
the  receiver  nonlinear,  and  the  modified  receiver  therefore  falls  outside  the  scope  of 
this  chapter.  However,  since  many  of  the  results  presented  here  will  apply  for  the 
modified  receiver  as  well,  a discussion  of  the  modified  receiver  is  included  and  can  be 
found  in  Section  3.7. 

Throughout  this  chapter  we  will  assume  that  the  receiver  is  synchronized  with  the 
desired  user  (user  number  1).  We  can  then,  without  loss  of  generality,  set  tx  — 0. 
This  implies  that  a!  = a2  = C!  and  that  the  received  vector  from  the  desired  user  is 
ri(m)  = di{m) (3iC]_.  The  results  in  this  chapter  may  easily  be  extended  to  the  case 
of  imperfect  synchronization.  Furthermore,  the  synchronization  problem  is  treated  in 
Chapter  4. 

The  original  work  of  this  chapter  is  (1)  the  derivation  of  the  effective  SNR  for  the 
MMSE  receiver  (3.44),  (2)  the  proofs  of  the  MMSE  receiver  asymptotic  behavior  found 
in  Theorems  3.1,  3.2  and  3.3,  (3)  the  formulation  of  previously  proposed  complexity 
reduction  schemes  as  special  cases  of  linear  dimension  reduction,  (4)  the  formulation 
and  proof  of  the  optimum  dimension  reduction  transform  in  Theorem  3.5,  and  (5)  the 
formulation  of  the  gamma,  modified  gamma,  SDR,  and  EGID  dimension  reduction 
schemes. 


3.2  Generic  Linear  Receiver 

In  this  chapter,  we  will  limit  the  discussion  to  linear  single-bit  single-user  re- 
ceivers. A linear  single-bit  single-user  receiver  forms  the  soft  decision — the  input  to 
the  threshold  device — as  the  real  part  of  the  inner  product  of  the  received  vector  and 
a receiver  vector  w e <CQN , as  shown  in  Figure  3.1.  The  hard  decision,  di(m),  is 
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then  formed  as  the  sign  of  the  soft  decision.  Receivers  of  this  class  does  attempt  to 
make  neither  a sequence  nor  a multiuser  detection  and  will  therefore  be  suboptimum 
in  nature  [22]. 


Figure  3.1:  Generic  receiver  structure. 

3.2.1  Probability  of  Error 
The  filter  output , yi(m),  is 

yi(m)  = w*r(m)  - w*ABz(m)  + w*n(m)  (3.1) 

Now  recall  from  (2.9),  (2.10),  and  (2.20)  that  z(m)  is  a function  of  the  data  vector 
d(m)  E { — 1, 1}2K  defined  as 

d(m)  = [di(m  - 1)  •••  dK(m  - 1)  dj(m)  •••  dK(m)]T  (3.2) 

Conditioned  on  d(m),  or,  equivalently,  conditioned  on  z(m),  the  filter  output  will  be 
a complex  Gaussian  random  variable  with  mean 

E[yi(m)  | z(m)]  = w‘ABz(m)  (3.3) 

and  second  moments 

E[|?/i(m)  — w*ABz(m)|2  | z(m)]  = w*  E[n(m)n*(m)]w  = ||w||2a2  (3.4) 

E [(yi(m)  — w’ABz(m))2  | z(m)]  = w*  E[n(m)nr(m)]w  = 0 (3.5) 

Thus,  conditioned  on  z(m),  the  soft  decision,  ui(m)  = Re{yi(m)},  will  be  a real 
Gaussian  random  variable  with  mean 


E[ui(m)  | z(m)]  = Re{w*ABz(m)} 


(3.6) 
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and  variance 


(3.7) 


We  assume  that  the  data  sequences  are  statistically  independent  of  each  other  and 
of  the  noise  sequence.  We  also  assume  that  all  22R  possible  realizations  of  d(m)  are 
equally  likely.  This  implies  that  we  can  write  the  average  error  probability  as 

Pe  = Pr[sgn[ui(m)]  ^ dffm)}  = Pr[ui(m)  < 0 | dffm)  = 1] 

Yl  Pr[iq(m)  < 0 | d(m),di(m)  = 1]  Pr[d(ra),  dffm)  - 1] 

d(m)e{-l,l}2if 
d\  (m)= 1 


The  error  probability  can  be  estimated  using  the  so-called  Gaussian  approxima- 
tion. In  essence,  the  Gaussian  approximation  means  that  the  soft  decision  is  modeled 
as  a Gaussian  random  variable  conditioned  on  dx  (m).  We  denote  the  conditional 
mean  and  variance  of  the  decision  variable  by  mu  and  a2,  respectively, 


We  see  that  m2Jo2u  plays  the  same  role  as  the  signal-to-noise  ratio  (SNR)  in  a single- 
user  system,  and  we  will  therefore  name  this  quantity  the  effective  SNR.  One  should 


(3.8) 


where  the  Q-function  is  defined  as 


(3.9) 


mu  = E[ui(m)  | dffm)  = 1]  = - E[ui(m)  | dffm)  = 1] 
a2  = var[ux(m)  | dffm)  = 1]  = var[ui(m)  | dffm)  = — 1] 


If  uffm)  is  Gaussian  distributed  conditioned  on  dffm),  it  is  straightforward  to  show 
that  the  error  probability  is 
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keep  in  mind  that  the  quality  of  the  estimate  (3.10)  is  dependent  on  how  close  the 
actual  conditional  distribution  of  iti(m)  is  to  a Gaussian  distribution. 

To  justify  the  Gaussian  approximation,  consider  the  soft  decision.  Recall,  since 
T\  = 0,  that  ai  = a2  = c4  and  Z\(m)  + 22(m)  — d\(m).  The  soft  decision  may  be 
written  as 


U\(m)  = Re{yi(m)} 


where 


= Re{di(m)exp(;'^i)w*Ci}  + Re{w*A,B'z,(m)}  + Re{w*n(m)}  (3.11) 

" ' v v ' ' v ' 

signal  part  multiple  access  noise  part 

interference 


A'  = [ a3  a4  • • • a2K-i  a2^  ] G IRgArx2(*  1}  (3.12) 

B'  = diag(/?2, 02,...,  fa,  fa)  e <|*<*-i>x2(Jr-i)  (3.13) 

z'(m)  = [z3(m)  z4(m)  •••  z2K-i(m)  z2K(m)  ]T  G {-1, 0,  l}2(Ar-1}  (3.14) 

The  first  term  in  (3.11)  is  the  desired  signal,  the  second  term  is  the  multiple  access 
interference  (MAI),  and  the  third  term  is  an  AWGN  term.  We  see  that  the  MAI  is 
a sum  of  random  variables 

2 K 

Re{w*A,B'z'(m)}  = £(Re{w*A'B  '})(j)Zj(m)  (3.15) 

1=3 

We  are  therefore  tempted  to  invoke  the  central  limit  theorem  that  allows  us  to  consider 
the  MAI  as  a Gaussian  random  variable  [31].  Furthermore,  the  MAI  is  independent 
of  the  noise  and  di(m),  since  terms  in  the  sum  in  (3.15)  are  independent  of  the  noise 
and  of  di{m). 

Thus,  if  the  central  limit  theorem  is  applicable  (see  Papoulis  for  a discussion  [31]), 
we  can  model  the  MAI  as  Gaussian  random  variable.  To  compute  the  error  proba- 
bility, we  need  only  find  the  variance  and  mean  of  the  MAI.  The  mean  of  the  MAI 
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is 

E[Re{w*A'B'z'(m)}]  = Re{w*A'B'}  E[z'(m)]  = 0 (3.16) 

and  the  conditional  mean  of  ux(m)  is  then  simply  the  signal  part  of  (3.11), 


mu  = Re{di(m)  exp(j0!)w*Ci} 


(3.17) 


The  variance  of  the  MAI  is 


g2uki  = E[Re2{w*A'B'z'(m)}] 

= Re{w*A'B'}  E[z'(m)z,r(m)]  ReT{w*A'B'} 

= ^l|Re{w*A,B'}||2  (3.18) 

Since  the  MAI  and  noise  are  independent  of  each  other  and  of  dx(m),  the  conditional 
variance  of  ux  ( m ) is  the  sum  of  the  noise  and  MAI  variances 

2 2 . IM|V  (n  ^ 

au  ~ °mai  H 2 (3.19) 


and  consequently  we  can  approximate  the  error  probability  by 


T*e,GA  = Q 


( Re{exp(jfli)w*Cy,i}\ 
\ \Z^mai  A ||w||2cr2/2  ) 


(3.20) 


Note,  however,  that  this  equation  is  only  valid  if  the  central  limit  theorem  is  appli- 
cable. 


3.2.2  The  Near-Far  Problem 


Consider  the  MAI  term  found  in  (3.11)  in  more  detail, 

K 

w*A'B'z'(m)  = Eyh  exp(j0fc)[z2fc-i(m)w*a2fc--i  + z2k(m)w*a2k}  (3.21) 

k=2 

The  MAI  is  seen  to  be  linear  in  y/Pk.  It  is  therefore  impossible  to  detect  d\(m) 
reliably  if  Pk  becomes  large  and  w is  not  orthogonal  to  a.2k-\  and  a2fc.  This  is  the 
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so-called  near-far  problem  and  is  a serious  threat  to  the  performance  of  a DS-CDMA 
system. 

The  traditional  solution  to  the  near-far  problem  is  to  employ  power  control.  The 
power  control  algorithm  tries  to  keep  all  received  powers  at  the  same  level  by  adjusting 
the  transmitted  powers.  However,  in  practice  this  is  a difficult  task,  and  performance 
is  severely  degraded  if  the  power  control  is  imperfect  [8]. 

The  performance  of  a receiver  is  normally  measured  by  the  bit  error  rate.  However, 
if  we  are  mostly  concerned  with  the  near-far  problem,  a convenient  performance 
measure  is  the  asymptotic  efficiency  [22],  The  asymptotic  efficiency  for  the  desired 
user  is  defined  as 


where  Ebik  = PkT  is  the  energy  per  bit  for  the  kth  user,  and  Pe(N0/2)  is  the  probability 
of  bit  error  given  by  (3.8).  Recall  that  the  error  probability  is  a function  of  a2,  which 
in  turn  is  a function  of  N0/ 2;  see  (2.4).  As  seen  from  (3.22),  the  asymptotic  efficiency 
is  a number  between  0 and  1.  The  Q-function  in  the  denominator  represents  the  error 
probability  of  a single-user  system  with  an  SNR  of  r2EbA/N0.  Thus,  the  asymptotic 
efficiency  is  a measure  of  how  much  extra  bit  energy  is  needed  by  the  multiuser 
system  to  achieve  the  same  bit  error  probability  as  a single-user  system,  in  the  limit 
as  the  noise  vanishes.  A nonzero  asymptotic  efficiency  implies  that  Pe(N0/2)  decays 
exponentially  as  N0/2  — >•  0,  since  Q(x ) decays  exponentially  for  large  x.  On  the  other 
hand,  for  a linear  receiver,  a zero  asymptotic  efficiency  implies  an  error  floor,  i.e.,  as 
the  SNR  goes  to  infinity,  the  receiver  will  still  have  a nonzero  error  probability  [22]. 

The  asymptotic  efficiency  is  in  general  a function  of  the  received  amplitudes  of  the 
interference  or,  equivalently,  a function  of  the  interfering  users’  received  bit  energies, 
Eb,k  for  k = 2,3,  The  near-far  resistance , fji,  is  defined  as  the  worst  case 


(3.22) 
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asymptotic  efficiency  over  all  possible  received  energies1 

Vi  = jnf  Vi  (3.23) 

Eb,k 

If  the  near-far  resistance  is  zero,  the  receiver  will  then  have  an  irreducible  error  floor 
as  N0/2  — > 0 in  the  worst  case  interference  scenario.  On  the  other  hand,  a nonzero 
near-far  resistance  guarantees  that  the  receiver  can  reliably  detect  the  desired  user, 
regardless  of  the  interference  situation,  if  2Eb,i/N0  is  large  enough. 

In  essence,  the  asymptotic  efficiency  and  near-far  resistance  are  measures  on  the 
robustness  of  the  receiver  to  the  near-far  problem. 

3.3  Standard  Receiver 

The  standard  receiver  is  simply  a filter  matched  to  the  desired  user’s  code  wave- 
form. In  our  setting  this  means  that  w = ws  = exp(j0i)ci.  The  soft  decision  for  the 
standard  receiver  is 

ui(m)  = Re  jdi(m)||c1||2  + ^ exp(-j6,1)c*rA;(m)  + exp(-j6>i)cin(m)|  (3.24) 
l k= 2 J 

The  standard  receiver  is  only  optimum,  i.e.,  has  the  lowest  probability  of  bit  error 
among  all  linear  single-bit  single-user  receivers,  if  the  MAI  term  in  (3.24)  is  zero. 
The  MAI  is  zero  if  K — 1 or  if  Ci  is  orthogonal  to  rfc(m)  for  k = 2,3, ...  ,K.  We 
dismiss  the  K — 1 case  and  note  that  the  second  condition  is  equivalent  to  that  cx 
is  orthogonal  to  {a3,  a4, . . . , a 2k}-  The  orthogonality  condition  cannot  in  general  be 
satisfied  in  an  asynchronous  system,  since  the  interfering  users’  a- vectors  are  functions 
of  the  propagation  delays,  see  (2.12)  and  (2.13). 

It  is  easy  to  show  that  the  standard  receiver  has  zero  near-far  resistance  by  letting 
Pk  — > oo  for  k = 2, 3, . . . , K [22],  The  standard  receiver  is  therefore  only  useful  in  a 
low  MAI  scenario. 

1Note  that  the  bar  over  fji  does  not  indicate  complex  conjugation.  This  notation  for  near-far 

resistance  is  adopted  for  consistency  with  other  work  in  the  CDMA  field. 
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3.4  Single-Bit,  Single-User  Decorrelating  Receiver 

When  the  interference  becomes  dominant  (i.e.,  when  the  noise  disappears,  a —>  0 
or  as  Pk  — > oo  for  k = 2,  3, . . . , K),  the  optimum  receiver  is  a decorrelating  receiver. 
The  receiver  vector  for  a decorrelating  receiver,  wd,  is  chosen  such  that  it  is  orthogonal 
to  the  interfering  a- vectors  while  still  having  a nonzero  inner  product  with  Ci.  The 
interference  term  is  thereby  eliminated  and  the  soft  decision  becomes 


ui(m)  = Re{w^ABz(m)  + w^n(m)}  = Re{d1(m)w^c1  exp(j0i)  + w^n(m)}  (3.25) 


From  the  requirements  on  wd,  we  conclude  that  the  decorrelating  receiver  is  guaran- 
teed to  exist  if  and  only  if  Ci  span{a3,  a4, . . . , a 2k}  = range(A'). 

The  error  probability  follows  from  (3.8)  as 


P , = n(  Re(wrfci  expO^i)} 

M W V \/||wd||2<72/2 


(3.26) 

Performance  is  therefore  maximized  if  we  choose  wd  as  the  vector  that  maximizes  the 
effective  SNR, 


2Re2{wdCi  exp(jflx)} 

<J2||wd||2 

Thus,  the  formal  definition  of  the  decorrelating  receiver  vector  is 

w d = exp(j6>i)  arg  max  ^ 

w£(range(A'))J-  ||w|p 

and  it  is  easy  to  show  that 


(3.27) 


(3.28) 


w d = C exp(j0i)u 


(3.29) 


where  C is  an  arbitrary  nonzero  real  constant  and  u is  the  orthogonal  projection  of 
Ci  onto  (range(A'))-1-. 

Note  that  the  decorrelating  receiver  as  defined  here  is  very  different  from  the 
Lupas  decorrelating  receiver  [22].  For  instance,  the  Lupas  decorrelating  receiver  is  a 
multiuser  sequence  detector,  i.e.,  it  detects  all  K users’  data  sequences  for  all  times 
in  a single  decision.  This  receiver  is  described  in  more  detail  in  Chapter  5. 
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3.5  Minimum  Mean-Squared  Error  Receiver 

In  the  above  we  mentioned  that  the  standard  receiver  is  optimum  if  the  inter- 
ference is  negligible,  and  the  decorrelating  receiver  is  optimum  if  the  interference  is 
dominant.  Both  of  these  situations  are  rather  extreme  in  the  sense  that  a practical 
system  would  probably  be  operating  in  a situation  where  neither  noise  nor  interference 
may  be  ignored.  The  optimum  choice  of  the  receiver  vector  is  therefore  dependent 
on  both  the  SNR  and  the  interference  level.  An  adaptive  receiver  that  can  alter  its 
receiver  vector  depending  on  the  current  scenario  is  therefore  desirable.  The  MMSE 
receiver  has  this  capability,  and  we  will  show  below  that  the  MMSE  receiver  degen- 
erates to  the  standard  receiver  and  the  decorrelating  receiver  as  the  MAI  and  noise 
disappear,  respectively.  It  is  hard  to  trace  the  exact  origin  of  the  MMSE  receiver. 
Many  researchers  have  mentioned  the  receiver;  however,  no  one  claims  to  be  the 
inventor  [27,  25,  1,  32,  61]. 

A block  diagram  of  the  MMSE  receiver  is  shown  in  Figure  3.2.  The  adaptive 
algorithm  adjusts  the  receiver  vector  such  that  the  mean  square  error  (MSE),  J = 
E[|e(m)|2],  is  minimized.  The  error  signal,  e(m),  is  formed  as  the  difference  between 
the  filter  output  and  the  desired  user’s  bit.  The  receiver  will  therefore  require  that 
a known  data  sequence,  or  training  sequence,  is  transmitted  before  any  actual  data 
can  be  transmitted.  Once  the  mean  square  error  is  at  an  acceptable  level,  data 
transmission  begins  and  the  error  signal  is  formed  as  the  difference  between  the  filter 
output  and  the  hard  decision.  These  modes  are  referred  to  as  training  and  decision 
directed  modes,  respectively. 

It  is  well-known,  see  Haykin  [15],  that  the  MSE  can  be  written  as 

J(w)  = E[|e(m)|2]  = E [d\(jn)\  - p*w  - w*p  + p*Rp  (3.30) 


where  R is  the  correlation  matrix  defined  in  (2.24),  and  p is  the  crosscorrelation 
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Figure  3.2:  Block  diagram  of  the  minimum  mean  square  error  receiver. 

vector 

p = E[d1(m)r(m)]  = cx  exp(j6>i)  (3.31) 

Differentiating  (3.30)  with  respect  to  w and  setting  the  result  to  zero  yields  the 
Wiener-Hopf  equations 

<9J(w) 

= 2(Rw  ~ P)  = 0 (3.32) 

If  R is  invertible,  the  minimizer  of  J(w)  and  unique  solution  to  (3.32)  is  the  vector 

w0  - R-1p  (3.33) 

and  the  resulting  mean-squared  error  is 

Jmin  = </(wQ)  = 1 - p*R_1p  (3.34) 

3.5.1  Gaussian  Approximation  Error  Probability  for  the  MMSE  Receiver 

The  general  formula  for  the  error  probability  (3.8)  is,  of  course,  valid  for  the 
MMSE  receiver.  However,  we  can  also  use  the  Gaussian  approximation  (3.20)  to 
estimate  the  error  probability,  and,  as  indicated  in  Section  3.7.3,  this  approximation 
seems  to  be  accurate.  The  effective  SNR  is  related  to  the  mean-squared  error  of  the 
MMSE  receiver  as  follows. 
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We  define  the  real  mean-squared,  error  as 

Jr  = E[Re2{e(m)}]  = E[(di(ra)  - ui(m))2]  (3.35) 

It  is  easy  to  show  that  E[Re2{e(m)}]  = E[Re2{e(m)}  | Hence,  the  real  mean 

squared  error  is  related  to  the  effective  SNR  as 

Jr  = 1 - 2 E[ui(m)  | di (m)]  + E [u\(m)  \ di(m)]  = 1 - 2mu  + m2  + a2u  (3.36) 


and,  thus,  the  effective  SNR  is 


K _ mu 

JT-  { 1 - mu)2 

We  now  proceed  to  find  expressions  for  Jr  and  mu  for  the  MMSE  receiver. 
Jr  = E[Re2{e(m)}]  = ^ E[e2(m)  + e2(m)  + 2|e(m)|2] 

We  known  from  (3.34)  that 


(3.37) 


(3.38) 


E[|e(m)|2]  = 1 - p*R  >p 


(3.39) 


Consider  E[e2(m)], 

E[e2(m)]  = E [(di(m)  - w*r(m))2] 

= 1-2  E[di(m)w*r(m)]  + w*  E[r(m)rr(m)]w0 
= 1 - 2w*p  + ^w*AB2Atw„  (3.40) 

It  is  easy  to  show  that  E[e2(m)]  = E[e2(m)]  and 

E[e2(m)  + e2(m)]  = 2 Re{E[e2(m)]}  = 2 - 4p*RT1p  + Re{w*AB2ATw0}  (3.41) 
We  can  now  write  Jr  as 

Jr  — ^[4  - 6p*R~1p  + Re{w*AB2ATw0}] 


(3.42) 


32 


Finally,  we  note  that 


mu  = E[Re{w*r(m)}  | di(m)  = 1] 

= Re{w*ci  exp(j^)}  = Re{w*p}  = p*R_1p  (3.43) 


By  combining  (3.37),  (3.42),  and  (3.43),  we  can  express  the  effective  SNR  for  the 
MMSE  receiver  as 

m2u  _ 4(p*Rr1p)2 

~ 2p*R_1p  - 4(p*R_1p)2  + Re{w*AB2Arw0}  ^3’44^ 


3.5.2  Asymptotic  Properties  of  the  MMSE  Receiver 

We  will  now  continue  and  prove  two  desirable  properties  of  the  MMSE  receiver; 
the  receiver  degenerates  to  the  matched  filter,  when  the  interference  disappear,  and  to 
the  decorrelating  receiver,  when  noise  disappear.  In  other  words,  the  MMSE  receiver 
will  be  the  optimum  receiver  for  both  of  these  extreme  situations. 

The  following  theorem  shows  that  if  the  interference  is  set  to  zero,  the  MMSE 
receiver  degenerates  to  the  matched  filter,  i.e.,  w0  = Cexp(jdi)ci,  where  C is  an 
arbitrary  nonzero  real  constant. 


Theorem  3.1  Suppose  that  a > 0.  If  c^a2fc_i  = c^a2fc  = 0 or  K = 1,  then  the 
MMSE  receiver  vector  is 


w 


O 


exp  (j6>!) 

«7^  + ||c1|PC1 


(3.45) 


Proof:  Recall  that  the  correlation  matrix  can  be  written  as 

K p 

qn  = o2IqN  + cic*  + ^2  -^-(a.2k-\a*2k-i  + a2k^2k)  (3.46) 

k= 2 Z 


R = ASA*  + cr2I( 
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Consider 


K p 

Rci  = 0-2C!  + CjC*C!  + ]T  -~{^2k-l^*2k-l  + a2yta2fc)c1  = (<72  + ||Ci||2)Ci  (3.47) 

fc= 2 Z 

Since  Ci  ^ 0,  Ci  is  an  eigenvector  of  R with  eigenvalue  Ai  = cr2  + ||ci||2.  Thus, 

_ ( expijOi)  \ 

R ( — ^ — -Ci  J = exp (j 0i ) Ci  = p (3.48) 

Hence,  exp(_)0i)ci/Ai  satisfies  the  Wiener-Hopf  equations  (3.32),  and  since  R is  in- 
vertible, the  MMSE  receiver  vector  is  unique  and  the  theorem  follows.  ■ 

To  show  that  the  MMSE  receiver  degenerates  to  the  decorrelating  receiver  as 
the  noise  goes  to  zero  is  a little  bit  more  involved.  The  reason  for  this  is  that  the 
correlation  matrix  is  not  invertible  for  a = 0,  and  therefore  the  solution  to  the  Wiener- 
Hopf  equations  (3.32)  is  not  unique. 


Theorem  3.2  Consider  R and  J(w)  as  given  by  (2.24)  and  (3.30),  respectively. 


Suppose  Ci  0 span{a3,  a4, . . . , a2ft-}.  Then,  as  a — > 0,  the  w0  £ CL  that  minimizes 
J( w)  is  such  that 


i - exp(j0i) 

hmw,,  = —..■■■■  u 
o-— 10  u 2 


(3.49) 


where  u is  the  orthogonal  projection  of  Ci  onto  (span{a3, . . . , a2i^}) J_ 
Proof:  The  vector  w0  satisfy  the  Wiener-Hopf  equations  Rw  = p where 


R — A S' A'  + CiC*  + ct2Iqiv  (3.50) 

and  where  A'  = [ a3  • • • a2K  ],  S'  = i diag(P2,  P2, . . . , P3)  P3),  and  p = exp(j0i)ci. 
If  QN  > 2K  - 1,  then  R is  invertible  if  and  only  if  a > 0.  For  cr  = 0 the  Wiener-Hopf 
equations  are 

0 = Rw  — p = A'S'A'*w  + cic^w  - exp(j0i)ci  (3.51) 

The  columns  of  A'  and  cx  are  linearly  independent,  and  S'  has  full  rank.  Hence, 


A'*w  = 0 £=>  w 6 null(A'*)  = [range(A')]_L 


(3.52) 
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cjw  - exp (jdi)  = 0 (3.53) 

Recall  the  eigenvalue  decomposition  of  R (2.29), 

R = E,(A'  + (72I2tf_i)E*  + tr2EnE;  (3.54) 

Since  Ci  £ range(Es)  it  follows  that  E*cq  = 0,  and  when  a > 0,  the  solution  to  the 
Wiener-Hopf  equations  is 

w0  = R_1p  = exp(j'01)Es(A/  + cr2I2x-1)-1E*c1  (3.55) 

Clearly, 

lim  w0  = exp(j01)EsA,_1E*c1  (3.56) 

From  this  equation  we  observe  that  linv_>0wo  £ range(Es),  and  since  range(Es)  = 
range(A)  = rangeQcq  A']),  it  follows  that 

range(Es)  = range(A')  + range(u)  (3.57) 

where  u is  the  orthogonal  projection  of  Ci  onto  [range(A,)]-L.  Recall  that,  by  assump- 
tion, u/0.  Now,  since  limff_>0  w0  £ range(Es)  and  Iimff_>o  wc  £ [range  (A')]1-,  it  is 
clear  that  lim^o  w0  £ range(u).  That  is,  for  some  complex  scalar  a, 

limw0  = cm  (3.58) 

Consider  (3.53), 

exp(j0i)  = aCjU  = a||u||2  =>  a = (3.59) 

and  the  theorem  follows.  ■ 

We  are  now  ready  to  derive  an  expression  for  the  asymptotic  efficiency  for  the 
MMSE  receiver.  As  it  turns  out,  the  asymptotic  efficiency  is  independent  of  the 
interfering  users’  received  energies;  therefore,  for  the  MMSE  receiver,  asymptotic 
efficiency  and  near-far  resistance  are  equivalent. 
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Theorem  3.3  The  near-far  resistance  for  a linear  single-bit  single-user  receiver  will 
be  zero  if  C\  £ span{a3, . . . , a^}.  If  Ci  0 span{a3, . . . , a2/<-},  then  the  asymptotic 
efficiency  for  the  MMSE  receiver  will  be 


m = 


where  w0  is  given  by 


1 

QN||w0| 


exp(jfli) 

Hull2 


u 


(3.60) 


(3.61) 


and  where  u is  the  orthogonal  projection  of  Ci  onto  (span{a3, . . . , a 2k})~ 
Proof:  The  soft  decision  can  be  written  as 


y\(m)  = d1(m)exp(^1)w*ci  + w*n(m) 

K 

+ exp(j^)(z2fc-i(m)w>2fc_1  + z2k{m) w*a2fc)  (3.62) 


As  the  noise  disappears  and  the  interfering  users’  received  powers  goes  to  infinity, 
the  soft  decision  will  be  dominated  by  the  interference  term  unless  w0  is  orthogonal 
to  {a3, . . . , a 2k}-  However,  if  Ci  £ span{a3, . . . , a2^},  then  w0  will  be  orthogonal  to 
Ci  as  well,  and  the  soft  decision  becomes  zero.  Clearly,  this  means  that  the  near-far 
resistance  of  the  receiver  is  zero.  Conversely,  if  Cx  0 span{a3, . . . , a2Ar},  then  the 
receiver  may  have  a nonzero  near-far  resistance. 

Suppose  Ci  0 span{a3, . . . , a 2k},  then  Theorem  3.2  says  that  as  a — >•  0, 


exp(j0i) 


-u 


u 


(3.63) 


where  u the  orthogonal  projection  of  cx  onto  (span{a3, . . . , a2ft-})-L.  Hence, 


Vi  (m)  = exp  (j6i)di(m)  = dx(m ) (3.64) 

Note  that  yi(m)  = w’ABz(m).  Thus,  the  argument  to  the  Q-functions  in  the  ex- 
pression for  the  error  probability,  (3.8),  is  seen  to  be  independent  of  d(m).  Thus,  the 
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error  probability  can  be  written  as 


Pe(N0/2)  = Q 


( l2E M 1 ^ 

VV  No  Q7V||w0||2j 


(3.65) 


since  a2  = QNN0/Ebti . From  the  definition  of  asymptotic  efficiency,  (3.22),  it  follows 
that 


Vl  Wlwoll2  (3‘66) 

Furthermore,  since  r\ \ is  independent  of  the  interfering  users’  received  energies,  the 
asymptotic  efficiency  and  the  near-far  resistance  are  equivalent.  ■ 


3.6  Dimension  Reduction 

The  MMSE  receiver  can  be  implemented  as  a QiV-tap  finite  impulse  response 
(FIR)  filter,  as  shown  in  Figure  3.3.  The  FIR  filter  is  fed  the  received  sequence,  (2.3), 
from  the  integrate-and-dump  sections,  and  the  filter  is  sampled  at  the  symbol  rate  to 
form  the  filter  output  yi(m). 


Figure  3.3:  FIR  implementation  of  the  MMSE  receiver. 


It  is  desirable  to  reduce  the  length  of  the  adaptive  filter  for  several  reasons.  The 
computational  burden  on  the  adaptive  algorithm  will  be  reduced.  The  numerical 
stability  is  also  increased,  and  the  excess  mean  square  error,  i.e. , the  extra  mean 
squared  error  added  to  Jmin  by  the  adaptation  process,  is  decreased  [15].  To  reduce 
the  filter  length,  we  introduce  a preprocessing  stage  in  the  form  of  a QN  x L complex 
matrix  V,  as  shown  in  Figure  3.4.  The  received  vector  is  premultiplied  by  V*,  which 
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produces  an  L-vector.  In  other  words,  the  dimensionality  of  the  received  vector  is 
reduced  from  QN  to  L.  We  assume  that  L < QN  and  that  V has  full  rank. 


Figure  3.4:  Reduced  dimension  MMSE  receiver. 

We  denote  the  reduced  dimensionality  received  vector  with  rv(m)  E <CL ; 

r v{m)  — V*r(m)  = V*ABz(m)  + V*n(m)  - AyBz(m)  + ny(m)  (3.67) 

where 

Ay  = V*A  = [ayti  •••  ay,2^]E(DLx2K  (3.68) 

ny(m)  = V*n(m)  G (CL  (3.69) 

The  correlation  matrix  for  rv(m)  is 

Ry  = E[ry(m)ry(m)]  = AySA^  + a2V*V  (3.70) 

The  results  obtained  in  Section  3.5  are  still  valid  for  the  reduced  dimension  case 
if  we  make  the  appropriate  substitutions  and  assumptions.  The  reduced  dimension 
MMSE  receiver  vector,  wGiy  E CL,  is  found  as  the  solution  to  the  Wiener-Hopf 
equations 

Rvwo,v  = Pv  (3-71) 

where  py  E (DL  is  defined  by 


py  = E[d1(m)ry(m)]  = V*p  = V*Ci  exp(j'6>i) 


(3.72) 
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The  error  probability  is 

p 1 v ^ ^Re{w* vAyBz(m)}^ 

22K~'  d,.,^  I VllVw*vl|V/2  J 

d,  i (m)— 1 

We  are  naturally  interested  in  finding  the  optimum  dimension  reduction  trans- 
form. The  optimum  transform  is  defined  as  the  transform  that  conserves  the  error 
probability,  i.e. , the  transform  V0  for  which  the  MMSE  receiver  based  on  V*r(m) 
has  the  same  error  probability  as  the  MMSE  receiver  based  on  r(m).  The  following 
lemma  and  theorem  will  give  sufficient  conditions  for  a transform  to  be  optimum. 


Lemma  3.4  Suppose  that  Es,  R,  ancTRy  are  defined  by  (2.29),  (2.24),  an d (3-70), 
respectively,  and  that  a > 0.  Furthermore,  suppose  that  V G (CLx<^iV  is  such  that 
V*V  = IL,  and  range(Es)  C range(V).  Then  for  x,  y G range(Es)  and  an  integer  p, 

x*(R-1)py  = (V*x)*(Ry1)p(V*y)  (3.74) 


Proof:  Note  that  R and  R\/  are  invertible  if  a > 0 and  that  V has  full  rank. 
According  to  (2.37),  we  can  write  (RyX)p  as 


(Ry1)P  = [ V*ES  Eny]A-p[V*Es  E n,v]*  (3.75) 


where 


0 

A n<v  _ 


Let  u G range(Es),  then  Theorem  2.2  allows  us  to  write 


(3.76) 


(V*Es)*(V*u)  = E*VV*u  = E*u  (3.77) 

and  since,  according  to  Theorems  2.3  and  2.1,  range(V*Es)  = range(V*A)  and  since 
V*u  G range(V*Es), 


K.  v(V*u)  = 0 


(3.78) 
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Hence,  for  x,  y G range(Es), 


(V*x)*(Ry1)p(V*y)  = (V*x)*  [V*ES  Eny]A^[V%  En,,]*(V*y) 


r(V*Ea)*(V*y)l 

= [ (V*x)*V*Es  (V*x)*En,K]A^ 

L K,v(V*y)  J 


= [(E lx)*  0]  A~p 

0 


(3.79) 


Since  E*x  = E*y  = 0 and  by  virtue  of  (2.29),  we  can  write  (3.79)  as 


(V*x)*(Ry1)p(V*y)  = x*  [Es  En]  A#P  ° [E.  En  ]*  y 

L 0 A-»J 


= x*(R_1)py 


(3.80) 


and  the  lemma  follows. 


Now  we  are  ready  to  formulate  and  prove  the  form  of  the  optimum  dimension 
reduction  transform. 

Theorem  3.5  Suppose  a2  > 0.  If  V e (DQ NxL  is  such  that  range(Es)  C range(V) 
and  V*V  = lL,  then  the  MMSE  receiver  with  dimension  reduction  transform  V will 
have  the  same  error  probability  as  the  MMSE  receiver  without  dimension  reduction. 
Proof:  Recall  that  the  probability  of  error  for  an  MMSE  receiver  with  dimension 
reduction  transform  V can  be  written  as 


where  w ay  — Ry'pv,  Ry  = V*RV,  = V*p,  and  p = exp(j^1)ci.  For  the  MMSE 
receiver  without  dimension  reduction,  the  error  probability  is 


(3.81) 


(3.82) 


40 


In  order  to  prove  the  theorem,  it  is  therefore  sufficient  to  show  that 

R,e{w*^V*  ABz(ttt)  } = Re{w*ABz(m)}  (3.83) 

||Vwoy||2  = ||w0||2  (3.84) 

Consider  w*vV*ABz(m)  and  recall  that  R y1  is  symmetric; 

woyV*ABz(m)  = Pv(Rv1)*V*ABz(m)  = (V*p)*R^(V*ABz(m))  (3.85) 

Note  that  p G range(Es)  and  ABz(m)  G range(Es).  According  to  Lemma  3.4  this 
means  that 


(V*p)*Rv1(V*ABz(m))  = p*R  1ABz(m)  = w’ABz(m)  (3.86) 

and  it  is  seen  that  (3.83)  is  satisfied.  Finally,  since  V*V  = IL, 

l|Vw„y||2  = IIVR^Vll2  = ||VR^Vp||2 

= p'VI^'rVVH^Vp  = (Vp)*R^2(V*p)  (3.87) 

Now,  since  p G range(Es),  we  can  use  Lemma  3.4  again  and 

||Vw0)v||2  =3  (V*p)*R^2(V*p)  = p*R-2p  = ||W0||2  (3.88) 

Thus,  (3.84)  is  satisfied,  and  the  theorem  follows.  ■ 

From  this  theorem  we  conclude  that  the  optimum  dimension  reduction  transform 
is  not  unique  and  requires  knowledge  of  the  signal  subspace  range(Es)  = range(A). 
Hence,  we  cannot  compute  the  optimum  transform  in  practice,  since  A is  a function 
the  unknown  propagation  delays,  see  (2.12)  and  (2.13).  Furthermore,  the  smallest 
L we  can  use  is  equal  to  the  dimensionality  of  the  signal  subspace;  in  other  words, 
L > rank(A).  If  we  assume  that  Tj  = 0,  then  L > 2 K — 1.  However,  if  we  know  w0, 
i.e.,  the  receiver  vector  for  the  MMSE  receiver  without  dimension  reduction,  we  may 
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use  the  transform  V — w0.  Clearly,  if  — C , where  C is  an  arbitrary  real  nonzero 
constant,  then  the  resulting  receiver  is  equivalent  to  the  full  length  receiver.  Thus, 
by  definition,  the  dimension  reduction  transform  V = w0  is  optimum.  However,  this 
transform  violates  the  conditions  of  Theorem  3.5.  With  this  in  mind,  it  seems  like 
the  optimum  transform  according  to  Theorem  3.5  is  of  limited  value.  On  the  other 
hand,  if  we  would  like  to  implement  a multiuser  detector,  the  optimum  transformation 
matrix  will  be  optimum  for  all  users  in  the  system  and  furthermore  w0  is  of  course 
unknown. 

3.6.1  Cyclically  Shifted  Filter  Bank 

Madhow  and  Honig  have  proposed  several  reduced  dimension  MMSE  receivers  [25]. 
The  authors  were  mainly  concerned  with  the  near-far  problem,  and  the  structures  were 
therefore  compared  in  terms  of  the  asymptotic  efficiency.  The  best  receiver  among  the 
ones  considered  is  the  so-called  cyclically  shifted  filter  bank  (CSFB).  A block  diagram 
for  the  CSFB  receiver  is  shown  in  Figure  3.5 


> 


VcSFBr(m) 


Figure  3.5:  Block  diagram  of  the  cyclically  shifted  filter  bank  receiver. 

The  elements  of  V£SFBr(m)  are  found  as  the  output  of  a bank  of  L filters,  where 
each  filter  is  matched  to  the  desired  vector  cyclically  shifted  with  A = QN/L.  We 
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assume,  for  convenience,  that  A is  an  integer.  The  CSFB  dimension  reduction  trans- 
form, V CSFB  £ $QNxL,  can  be  written  as 


where  c1>m  denotes  Ci  cyclically  shifted  m steps.  It  is  seen  from  (3.89)  that  Vcsfb 
cannot  be  optimum  since  all  cyclical  shifts  of  c:  are  assumed  to  be  linearly  indepen- 
dent of  {a3,  a4, . . . , a2K}-  Thus,  aj  £ range(VCsFB)  for  any  j = 3,4,...,  2 K,  and  the 
signal  subspace  will  not  be  contained  in  range(VCsFB)- 

3.6.2  Gamma  and  Modified  Gamma 

The  gamma  filter  MMSE  receiver  is  depicted  in  Figure  3.6.  The  filter  input  is  the 
received  sequence  r(l)  = rf(l)  + jrQ(l),  as  defined  by  (2.3).  The  gamma  filter  is  a 
generalized  feed-forward  filter  found  by  replacing  the  delay  elements  in  an  ordinary 
tapped  delay  line  with  leaky  integrators,  or  gamma  kernels  [35].  The  gamma  kernel 
is  controlled  by  an  adaptive  parameter  fi.  The  transfer  function,  G(z),  for  a gamma 
kernel  is 


The  kernel  has  a single  pole  at  1 - p and  is  therefore  stable  for  fi  e (0,2).  It  can 
be  shown  that  p and  the  weights  wi,  w2, . . . ,Wl  can  be  adapted  simultaneously  such 
that  the  MSE  is  minimized  [35]. 

The  effect  of  the  gamma  filter  is  to  perform  dimension  reduction  and  adaptation 
of  the  receiver  vector  simultaneously.  We  assume  that  the  filter  is  cleared  after  each 
sampling  of  the  filter.  Thus,  a filter  cycle  is  as  follows:  the  tap  values  are  set  to  zero, 
QN  samples  of  the  received  sequence  are  propagated  through  the  cascaded  gamma 
kernels,  the  resulting  tap  values  are  multiplied  with  the  tap  weights,  and  finally  the 
filter  is  sampled. 


Vcsfb  = [ Ci  cliA  •••  Ci^-a] 


(3.89) 


for  0 < fj,  < 1 


(3.90) 
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Figure  3.6:  Block  diagram  of  gamma  and  modified  gamma  receiver. 

The  dimension  reduction  transform  is  denoted  by  Vr  G IRq;VxL,  and  the  (i,j)th 


element  of  Vp  is 

vr  (i,j)  = 

where  gk(n)  is  the  impulse  response  from  the  input  to  the  kth  tap 

(n  — 1N 


(3.91) 


9k{n)  = < 


£*(!-//)"“*  for  n>k 


— 1/  - (3.92) 

0 for  n < k 

As  seen  from  (3.92),  the  dimension  reduction  is  adaptive  in  nature.  However, 
since  Vr  is  controlled  by  only  one  parameter  //,  the  form  of  the  dimension  reduction 
transform  is  limited.  For  instance,  it  is  easy  to  see  that  Vp  cannot  be  optimum  in 
the  sense  of  Theorem  3.5.  Nevertheless,  we  expect  the  gamma  receiver  to  perform 
better  than  the  CSFB  when  the  system  is  interference  dominated.  This  is  since  we 
have  more  freedom  in  the  choice  of  dimension  reduction  transform  in  the  gamma  filter 
case.  Conversely,  we  expect  that  the  CSFB  receiver  will  work  better  than  the  gamma 
receiver  in  a noise  dominated  scenario,  since  the  gamma  filter — as  opposed  to  the 
cyclically  shifted  filter  bank — can  never  be  a matched  filter.  These  heuristics  were 
verified  by  the  numerical  results  presented  by  Strom  and  Miller  [47]. 

In  an  attempt  to  improve  the  performance  of  the  gamma  receiver,  the  structure 
was  modified  to  include  a tap  that  corresponds  to  the  output  of  the  matched  filter. 
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The  modified  gamma  receiver  is  depicted  at  the  bottom  of  Figure  3.6,  and  the  resulting 
dimension  reduction  transform,  denoted  by  Vmr  e ]RQAr  xL,  is 


where  Vr,L-i  is  the  gamma  transform  with  L — 1 taps. 

The  performance  of  the  modified  gamma  receiver  has  been  experimentally  shown 
to  be  similar  to  the  gamma  receiver  in  a heavily  loaded  system  and  marginally  better 
in  a lighter  loaded  system  [47]. 

3.6.3  Symmetric  Dimension  Reduction 

The  symmetric  dimension  reduction  (SDR)  scheme  forms  the  elements  of  the 
reduced  received  vector  as  partial  sums  of  the  matched  filter  output.  As  seen  from 
the  block  diagram  in  Figure  3.7,  there  are  A = QN/L  taps  in  each  partial  filter  (A 
is  again  assumed  to  be  an  integer).  If  we  let  wi  = w2  = • • • = wL  - exp then 
the  SDR  receiver  degenerates  to  the  matched  filter.  Since  the  SDR  has  this  property, 
it  is  expected  that  the  symmetric  dimension  reduction  receiver  will  perform  similarly 
to  the  CSFB  in  a noise  dominated  scenario.  We  also  note  that  the  SDR  receiver 
requires  less  computational  complexity  in  the  dimension  reduction  stage  than  the 
CSFB  receiver. 

The  dimension  reduction  transform,  VSdr  £ IR<3Arxi,  is  found  as 


Vmr  — [Vr,L_i  Ci.  ] 


(3.93) 


VsDR  — [Ci,i;A  C1,A+1:2A  "•  Cl,(L-l)A+l:QN  ] 


(3.94) 


where  Cijr:p  G IR^,  for  p > r,  denotes  the  vector 


Ci,r:p  = [0  •••  0 cx(r)  ci(r  + l)  •••  c i(p)  0 ■■■  0]T 


(3.95) 


r — 1 zeros 
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VsDRr(m) 


Figure  3.7:  Block  diagram  of  symmetric  dimension  reduction  receiver. 

3.6.4  Eigenvector  Identification 

The  optimum  transform  cannot  be  computed  directly  in  practice  since  the  signal 
subspace,  range(Es),  is  unknown.  However,  we  can  certainly  estimate  range(Es). 
Recall  that  Es  is  a matrix  whose  columns  are  the  eigenvectors  corresponding  to  the 
2K  -l  largest  eigenvalues  of  the  correlation  matrix  R.  Now,  since  R is  symmetric, 
Es  can  be  found  as  the  2 K - 1 first  left  singular  vectors  of  R.  It  is  easy  to  see  that 
V = E,  satisfies  the  conditions  for  the  optimum  transform  according  to  Theorem  3.5. 
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We  can  estimate  R = E[r(m)r*(ra)]  with  the  sample  correlation  matrix 


Y m 

rm  = ^E  r(m)r*(m)  = UEW’ 


m=  1 


(3.96) 


where  USW*  is  the  singular  value  decomposition  (SVD)  of  RM.  Es  is  now  estimated 
by  the  2 K — 1 first  columns  of  U, 


VeGID  = [Ui  U2  •••  U.2K—1  ] 


(3.97) 


This  transform  is  called  the  eigenvector  identification  (EGID)  dimension  reduction 
transform. 

The  quality  of  the  estimate  increases  with  M;  as  matter  of  fact,  it  can  be  shown 
that  Vegid  is  a consistent  estimate  of  Es,  that  is,  as  M ->■  oo,  then  VEgid  — > Es.  To 
compute  Vegid  , we  can  of  course  use  an  ordinary  eigenvalue  decomposition  instead 
of  an  SVD. 

3.6.5  Numerical  Results 

In  this  section  we  will  present  some  results  from  Monte-Carlo  simulations  of  differ- 
ent reduced  dimension  MMSE  receivers.  We  will  consider  two  performance  measures: 
asymptotic  efficiency  and  probability  of  bit  error.  As  mentioned  above,  the  asymp- 
totic efficiency  and  the  near-far  resistance  are  equivalent  for  an  MMSE  receiver.  The 
asymptotic  efficiency  is  a number  between  zero  and  one.  A zero  near-far  resistance 
implies  that  we  will  have  an  irreducible  error  as  the  noise  power  disappears.  That 
is,  we  cannot  reduce  the  error  probability  to  an  arbitrary  level  by  increasing  the 
signal-to-noise  ratio.  On  the  other  hand,  a nonzero  near-far  resistance  implies  that 
the  error  probability  decreases  exponentially  with  the  signal-to-noise  ratio  when  the 
noise  power  goes  to  zero.  It  is  important  to  realize  that  a better  near-far  resistance 
does  not  imply  a lower  bit  error  probability  in  a noisy  scenario.  For  instance,  during 
heavy  loading,  that  is,  when  L ~ 2 K,  the  gamma  and  modified  gamma  receivers  have 


47 


5000 

4500 

4000 

3500 

3000 

2500 

2000 

1500 

1000 

500 

0 


Full-length  FIR 


mean:  0.6875 
variance:  0.0110 


L. 


5000  r 
4500  - 
4000  - 
3500  - 
3000  - 
2500  - 
2000  - 
1500  - 
1000  - 
500  - 


0 0.1  0.2  0.3  0.4  0.5  0.6  0.7  0.8  0.9  1 

m 

Symmetric  Dimension  Reduction 

~ i 1 1 1 

mean:  0.1897 
variance:  0.0205 


BBllBltollhm-. 


0 0.1  0.2  0.3  0.4  0.5  0.6  0.7  0.8  0.9  1 

m 


5000  - 
4500  - 
4000  - 
3500  - 
3000  - 
2500  - 
2000  - 
1500  - 
1000  - 
500  - 
0 - 


Eigenvector  Identification,  M = 100 


- i r- 


mean:  0.3838 
variance:  0.0290 


0 0.1  0.2  0.3  0.4  0.5  0.6  0.7  0.8  0.9  1 

m 

Cyclically  Shifted  Filter  Bank 


Figure  3.8:  Histograms  of  the  near-far  resistance  for  100,000  Monte-Carlo  runs.  The 
mean  and  variance  indicated  are  computed  as  the  sample  mean  and  sample  variance, 
respectively. 


a larger  near-far  resistance  than  the  cyclically  shifted  filter  bank  [47].  However,  in  a 
less  loaded  system  or  when  we  have  a nonzero  noise  power,  the  cyclically  shifted  filter 
bank  will  outperform  the  gamma  structures  [47]. 

The  simulated  system  was  a K = 5 user  system  with  Q — 1 and  N = 31  chips 
per  bit  Gold  sequences  [39].  The  dimension  reduction  was  done  by  an  IV  x L matrix 
where  L = 10.  This  corresponds  to  rather  heavily  loaded  system.  In  the  bit  error 
experiment,  the  signal-to-noise  ratio,  2 Eb/N0,  was  set  to  10  dB  for  all  users  (i.e., 
perfect  power  control  was  assumed).  To  compute  the  EGID  transform,  according 
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Figure  3.9:  Histograms  of  the  error  probability  for  100,000  Monte-Carlo  runs.  The 
mean  and  variance  indicated  are  in  linear  units  and  are  computed  as  the  sample  mean 
and  sample  variance,  respectively. 


to  (3.97),  we  need  to  observe  the  received  vector  for  M bit  intervals.  In  the  simula- 
tions M = 100  was  used.  Independent  realizations  of  the  propagation  delays,  data 
sequences,  received  phases,  and  noise  vectors  were  drawn  for  each  Monte-Carlo  run. 
A total  of  105  Monte-Carlo  runs  were  done  per  histogram. 

The  plots  presented  in  Figure  3.8  are  histograms  of  the  near-far  resistance  of 
the  full-length  receiver,  cyclically  shifted  filter  bank,  eigenvector  identification,  and 
symmetric  dimension  reduction  transforms.  In  Figure  3.9,  histograms  of  the  error 
probability  for  the  same  dimension  reduction  schemes  are  shown.  The  error  proba- 
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bility  was  computed  using  the  Gaussian  approximation  (3.20). 

As  seen  from  these  plots,  the  full-length  receiver  is  superior  to  the  reduced  di- 
mension structures  in  both  near-far  resistance  and  error  probability.  The  symmetric 
dimension  reduction  scheme  is  slightly  better  than  the  cyclically  shifted  filter  bank. 
The  EGID  transform  is  clearly  better  than  both  the  SDR  and  CSFB  structures.  For 
more  extensive  conclusions,  see  Section  3.8. 

3.7  Linear  Conjugate  Linear  MMSE  Receiver 

During  the  time  this  chapter  was  compiled,  the  author  was  made  aware  of  a very 
simple  modification  to  the  MMSE  receiver  that  achieves  better  performance  with 
lower  computational  complexity  [3].  The  modification  is  the  introduction  of  a nonlin- 
earity, and  thus  the  modified  receiver  does  not  belong  to  the  class  of  linear  receivers 
analyzed  in  this  chapter.  Nevertheless,  this  section  was  included  since  the  modifi- 
cation is  remarkably  simple  and  it  is  believed  that  many  of  the  results  in  presented 
previously  in  this  chapter  will  hold  for  the  modified  receiver  as  well. 

The  underlying  theory  of  this  modification  was  developed  by  Brown  and  Crane 
as  early  as  1969  [7].  Aue  and  Reed  named  the  modified  receiver  the  reduced  linear- 
conjugate-linear  Fourier  series  representation  sequence  estimator  (reduced  LCL-FS- 
SE)  [3].  For  brevity  and  consistency  with  the  notation  in  this  dissertation,  we  choose 
to  call  the  modified  receiver  the  linear  conjugate  linear  MMSE  (LCL-MMSE)  receiver. 

3.7.1  Linear  Conjugate  Linear  Filtering 

If  the  correlation  matrix  E[r(ra)rT(m)]  is  nonzero,  then  the  introduction  of  a con- 
jugate branch  into  the  MMSE  filter,  as  depicted  in  Figure  3.10,  will  increase  the  per- 
formance in  the  sense  that  the  resulting  MSE  is  lower  than  Jmin  as  given  by  (3.34)  [7]. 
Note  that  conjugation  is  a nonlinear  operation. 

We  define  the  complex  correlation  matrix  as  Rc  = E[r(  m)rT(m)].  It  easy  to  show, 
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Figure  3.10:  Block  diagram  of  the  linear  conjugate  linear  MMSE  receiver, 
since  E[n(m)nT(m)]  = 0 and  B is  diagonal,  that 

Rc  = E[r(m)rT (m)\  = AB  E[z(ra)zr(ra)]BrAr  = iAB2AT  (3.98) 

2 

R.c  is  clearly  nonzero,  and  we  therefore  expect  that  the  LCL-MMSE  filter  will  perform 
better  than  the  MMSE  filter. 

The  error  signal  is  formed  as 

r(m) 


e(m)  = ch  (m)  - w*0  0r (m)  - w;cr (m)  = dx  (m)  - [ w*  „ w*  c ] (3.99) 

Lr(m) 

We  can  therefore  view  the  LCL-MMSE  filter  as  an  augmented  filter  with  filter  vector 
wa  = [w^o  wo,c]T  and  filter  input  ra(m)  = [rr(m)  rT(m)]T.  The  input  can  be 
written  as 

ra(m)  = AaBaza(m)  + n a(m)  (3.100) 


where 


z a(m) 


A 0 

0 A 
B 0 

0 B 

z(m)  ’ 

z(m)  _ 


e jp^2QATx4/<' 


e C 


iKxAK 


G {-1,0,1} 


AK 


51 


na(m) 


n (m) 
n (m) 


G C2<?7V 


It  is  straightforward  to  find  the  augmented  filter  vector  as  wa  = where 


Ra  = E[ra(m)r*(m)] 
pa  = E[di(m)ra(m)] 


R 

Rc 

R, 

R 

P 

P_ 

(3.101) 

(3.102) 


and  where  R and  p is  the  correlation  matrix  and  crosscorrelation  vector  defined 
by  (2.24)  and  (3.31),  respectively. 


The  complexity  of  the  MMSE  receiver,  depicted  in  Figure  3.10,  is  roughly  twice 
that  of  the  ordinary  MMSE,  illustrated  in  Figure  3.2.  Thus,  whatever  performance 
gain  the  LCL-MMSE  receiver  will  give  us  might  be  offset  by  its  increased  complexity. 
However,  we  will  show  below  that  the  LCL-MMSE  receiver  can  be  implemented  with 
two  real  filters,  rather  than  with  two  complex  filters.  The  resulting  complexity  will 
actually  be  less  than  the  ordinary  MMSE  receiver,  since  one  complex  multiplication 
usually  requires  more  complexity  than  two  real  multiplies;  therefore,  one  complex 
filter  requires  more  complexity  than  two  real  filters. 

Now  consider  the  equation  pa  = Rawa 


P 

R 

Rc 

’w0>0' 

.P. 

Rc 

R 

. W0,c . 

which  can  be  rewritten  as 


(3.103) 


p' 

R 

Rc 

’ w0iC ' 

’ w0;C ' 

— 

C3 

K 

II 

-P. 

o 

i 

R 

„W0i0 

,w0)0_ 

(3.104) 


since  R = R.  Hence,  since  Rawa  = pa, 


’w0)0‘ 

'w0)C' 

- wo,c . 

W¥. 

(3.105) 
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or  wo,o  = w0)C.  The  error  signal  may  therefore  be  written  as 


e(m)  = (him)  - w*0r(m)  - w*0r (m) 

= Mm)  -2^{w*„r(m)} 

= di  (m)  - 2 [ReT{w0;0}  Re{r(m)}  - ImT{w0)0}  Im{r(m)}]  (3.106) 

In  other  words,  the  LCL-MMSE  receiver  can  be  implemented  as  shown  in  Figure  3.11, 
where  wo  r = 2 Re{w0O}  and  w0;j  = —2  Im{w0  0}.  Note  that  this  implementation  only 
uses  real  filters,  and  consequently  the  computational  complexity  is  reduced  compared 
with  the  ordinary  MMSE  receiver  (which  requires  complex  filters). 


Figure  3.11:  LCL-MMSE  receiver  using  only  real  filters. 


3.7.2  MMSE  Receiver  with  A Priori  Phase  Knowledge 

Suppose  we  know  the  desired  user’s  phase,  6X.  A natural  modification  to  the 
MMSE  receiver  is  then  to  remove  the  desired  user’s  phase  from  the  received  vectors 
and  process  only  the  real  part  of  the  resulting  vectors,  as  depicted  in  Figure  3.12. 
This  receiver  is  called  the  real  MMSE  receiver. 

The  input  to  the  filter  is 

rr(m)  = Re{r(m)  exp(-j6h)}  = ABrz(m)  + n r(m)  (3.107) 
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Figure  3.12:  MMSE  receiver  using  only  real  filtering, 
where  n T(m)  = Re{n(m)  exp(—  jdi)}  is  a real  Gaussian  zero-mean  random  vector 
with  correlation  matrix 

2 

E[nr(m)n^(m)]  = Y1QN  (3.108) 

and  Br  e IR2**2*  is 

Br  = Re{B  exp(—  j0\)}  (3.109) 

The  correlation  matrix  is 

Rr  = E[rr(ra)if(m)]  = ^[AB2Ar  + a2lQN\  (3.110) 

and  the  crosscorrelation  vector  is 

pr  = E[d1(m)rr(ra)]  = Ci  (3.111) 

It  is  well-known  that  the  minimum  MSE  is  achieved  by  using  the  receiver  vector 
w0irr  = R“'pr,  and  that  the  minimum  MSE  is 

Jr  = 1 - P^R^Pr  (3.112) 

The  decision  variable  is  simply  the  output  of  the  filter,  and  thus  the  mean  is 

mu  = E[wJrrrr(m)  | di  (m)  = 1]  = wjrrd  = wjrrpr  (3.113) 

The  effective  SNR  for  the  real  receiver  can  now  be  found  as 

= 1 

a2  JT  (1  mu)2  l/pJ"R~1pr  - 1 


(3.114) 
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3.7.3  Simulation  of  LCL-MMSE  and  MMSE  Receivers. 

We  refer  to  the  LCL-MMSE  receiver  defined  by  Figure  3.10  as  type  I and  the  LCL- 
MMSE  receiver  defined  by  Figure  3.11  as  type  II.  In  this  section  we  will  demonstrate 
experimentally  that  the  type  I and  type  II  LCL-MMSE  receivers  are  indeed  equivalent 
and  furthermore  compare  the  performances  of  the  LCL-MMSE,  MMSE,  and  real 
MMSE  receivers. 

The  simulated  system  was  a K = 5 user  system  with  iV  = 15  chips  per  bit  Gold 
sequences.  No  oversampling  was  done,  and  perfect  power  control  was  assumed,  i.e., 

Q = 1 and  7r  = 72  = • • • = 7*.  The  propagation  delays  and  phases  were  (randomly) 
selected  to  be 


t/Tc  = [ 0 2.1942  7.7568  3.8888  3.8003]r  (3.115) 

9 = [0.7251  2.8904  3.9915  6.1891  1.7940  }T  (3.116) 

For  each  Monte-Carlo  run,  independent  realizations  of  the  data  and  noise  wave- 
forms were  created.  The  resulting  realization  of  the  received  vector  sequence  was 
processed  by  the  receivers,  and  the  MSE  and  number  of  errors  were  recorded.  The 
error  probability  was  computed  by  the  exact  expression  (3.8)  and  by  the  Gaussian 
approximation  formula  (3.20).  The  results  of  106  Monte-Carlo  runs  are  summarized 
in  Table  3.1. 

It  is  evident  that  the  LCL-MMSE  type  I and  II  receivers  are  equivalent.  Further- 
more, there  is  a fair  agreement  between  the  simulated  and  theoretical  quantities,  and 
the  Gaussian  approximation  is  shown  to  be  reasonably  correct.  These  results  indicate 
that  the  LCL-MMSE  receiver  performs  better  than  the  real  MMSE  receiver,  which, 
in  turn,  is  slightly  better  than  the  MMSE  receiver. 

Plots  of  the  error  probability  as  a function  of  the  SNR,  2Eb/N0,  is  shown  in 
Figure  3.13.  The  error  probability  for  the  different  receiver  were  computed  in  three 
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Table  3.1:  Results  of  simulation  for  SNR  2 Eb/N0  = 10  dB. 


MMSE 

Real 

LCL-MMSE  I 

LCL-MMSE  II 

J 

Theor. 

0.2417 

0.1297 

0.1119 

0.1119 

Simul. 

0.2416 

0.1297 

0.1119 

0.1119 

Pe 

Theor. 

5.3385  x 10"3 

4.6819  x 10-3 

2.4139  x 10~3 

2.4139  x 10“3 

Gauss. 

5.4001  x 10-3 

4.7875  x 10~3 

2.4234  x 10~3 

2.4234  x 10~3 

Simul. 

5.3410  x 10~3 

4.6890  x 10~3 

2.3300  x 10~3 

2.3300  x 10~3 

ways:  analytically,  (o)  joined  by  solid  lines;  by  the  Gaussian  approximation,  (x) 
joined  by  dashed  lines;  and  by  simulation,  (*)  joined  by  dotted  lines.  The  curves  are 
from  the  top:  the  MMSE  receiver,  the  real  MMSE  receiver,  and  the  LCL-MMSE  I 
and  II  receivers  (overlayed).  For  reference,  the  single-user  system  error  probability, 
i.e.,  Q(y/2Eb/N0),  is  plotted  as  the  bottommost  curve.  As  seen  from  Figure  3.13, 
the  LCL-MMSE  receiver  performs  better  than  the  real  MMSE  receiver,  which,  in 
turn,  outperforms  the  MMSE  receiver.  These  results  can  heuristically  be  explained 
as  follows. 

Recall  that  the  error  signal  is  formed  as  the  difference  between  the  desired  user’s 
bit  and  the  filter  output,  i.e.,  e(m)  = di(m)  - yx(m).  Note  that  only  the  real  part 
of  the  error  signal  can  cause  the  receiver  to  make  a wrong  decision,  since  di(m)  = 
sgn[Re{yi(m)}].  Moreover,  the  complex  MMSE  receiver  minimizes  the  magnitude  of 
the  error,  and  the  real  MMSE  receiver  minimizes  the  real  part  of  the  error.  Thus,  the 
complex  receiver  will  generally  perform  worse  than  the  real  receiver. 

The  performance  difference  between  the  real  MMSE  and  LCL-MMSE  receivers  can 
be  explained  by  the  following  example.  Suppose  the  desired  user’s  phase  is  zero,  i.e., 
#i  = 0.  This  means  that  the  imaginary  branch  of  the  type  II  LCL-MMSE  receiver 
will  not  contain  any  information  about  the  desired  user.  Thus,  we  are  tempted 
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Figure  3.13:  Error  probability  as  a function  of  SNR 

to  delete  the  imaginary  branch  altogether.  This  will  be  a mistake,  however,  since 
plenty  of  information  regarding  the  interference  still  remains  in  this  branch.  Thus, 
by  processing  this  information  and  adding  it  to  the  soft  decision,  we  can  make  the 
receiver  more  reliable.  We  can  view  the  LCL-MMSE  receiver  as  a form  of  interference 
cancellation  on  top  of  a linear  receiver. 

3.8  Conclusions 

The  main  contributions  of  this  chapter  are  found  in  Sections  3.5  and  3.6. 

In  Section  3.5,  the  basic  properties  of  the  MMSE  receiver  were  derived.  It  was 
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shown  that  the  MMSE  receiver  degenerates  to  the  matched  filter,  if  the  multiple-access 
interference  disappear,  and  to  the  decorrelating  solution,  if  the  noise  disappear.  This 
means  that  the  MMSE  receiver  will  be  the  optimum  receiver  in  these  two  extreme 
situations  (recall  that  optimum  receiver  here  is  defined  as  the  linear  single-user  single- 
bit receiver  with  the  smallest  error  probability). 

In  Section  3.6,  complexity  reduction  through  dimension  reduction  was  proposed. 
The  optimum  form  was  derived  and  shown  to  be  dependent  on  the  unknown  A matrix 
(recall  that  A is  defined  by  the  K users’  code  waveforms  and  propagation  delays).  The 
conclusion  is  therefore  that  we  cannot  implement  the  optimum  transform  in  practice. 
However,  several  practical  suboptimum  schemes  (the  gamma,  modified  gamma,  and 
symmetric  dimension  reduction  (SDR)  methods)  and  one  asymptotically  optimum 
scheme  (the  eigenvector  identification  (EGID)  method)  were  proposed.  These  schemes 
were  compared  with  the  cyclically  shifted  filter  bank  (CSFB)  which  was  the  best 
previously  known  method  [25]. 

The  EGID  scheme  is  clearly  the  best  but  also  the  most  complex.  The  SDR  is 
the  simplest  scheme  and  works  slightly  better  than  the  CSFB  method.  The  gamma 
filters  did  not  perform  as  well  as  the  other  schemes,  except  when  the  interference  is 
extremely  large.  Thus,  the  SDR  is  the  method  of  choice  if  complexity  is  a concern. 
We  should  not  use  the  EGID  scheme  if  complexity  is  an  issue.  However,  if  the  EGID 
receiver  is  allowed  to  observe  the  received  signal  for  an  extended  period  of  time,  it  will 
(asymptotically)  have  the  same  performance  as  the  full-length  receiver  in  the  sense 
that  the  attainable  minimum  mean-squared  errors  are  the  same.  Moreover,  since  the 
excess  mean-squared  error  will  be  less  for  the  EGID  scheme  than  for  the  full-length 
receiver,  the  EGID  receiver  might  still  be  of  practical  interest. 

In  Section  3.7,  a modification  to  the  MMSE  receiver,  due  to  Aue  and  Reed  [3], 
was  presented.  This  modification  has  lower  complexity  and  better  performance  than 
the  ordinary  MMSE  receiver.  It  is  believed  that  the  results  from  Sections  3.5  and  3.6 


58 


are  valid  the  modified  receiver  as  well.  This,  however,  is  not  proven  at  the  present 
time  and  is  subject  to  future  work. 


CHAPTER  4 

PARAMETER  ESTIMATION  FOR  THE  AWGN  CHANNEL 


Many  near-far  resistant  receivers  have  been  proposed  in  the  last  decade,  [57,  61, 
22,  56],  to  mention  just  a few.  All  these  schemes  require  exact  knowledge  of  one 
or  several  parameters,  such  as  each  user’s  propagation  delay,  received  power  level, 
and  carrier  phase.  In  a practical  system,  these  parameters  need  to  be  estimated 
and  will  therefore  be  subject  to  estimation  errors.  Relatively  little  has  been  done 
in  terms  of  investigating  the  impact  of  such  errors  on  receivers  that  assume  perfect 
side-information  and  on  how  the  estimation  algorithms  should  be  designed  to  min- 
imize these  adverse  effects.  However,  lately  there  has  been  some  work  done  in  the 
area  of  joint  detection  i.e. , joint  data  detection  and  parameter  estimation.  See,  for 
instance,  litis,  who  uses  an  extended  Kalman  filter  to  obtain  parameter  estimates  in 
a single-user  scenario  [18],  and  Radovic  and  Aazhang,  who  combine  the  expectation 
maximization  (EM)  algorithm  and  the  multistage  detector  in  a near-far  multiuser 
environment  [37]. 

The  standard  method  for  code  acquisition — i.e.,  initial  propagation  delay  estima- 
tion,  typically  before  data  transmission — is  the  sliding  correlator  and  modifications 
thereof  [33,  38].  These  methods  are  essentially  single-user  algorithms;  they  work  rea- 
sonably well  in  a multiuser  environment  if  the  received  powers  are  similar,  but  fail  in 
a near-far  environment  [29,  28].  Other  approaches  to  code  acquisition  are  the  rapid 
acquisition  by  sequential  estimation  (RASE)  algorithm  [59]  and  its  multiuser  exten- 
sion the  multiple- access  RASE  (MARASE)  algorithm  [30].  The  sliding  correlator, 
RASE,  and  MARASE  algorithms  assume  that  a known  bit  sequence  is  transmitted, 
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and  thus  they  cannot  be  used  for  tracking — propagation  delay  estimation  during  data 
transmission. 

The  main  contribution  of  this  chapter  is  the  treatment  of  the  full  estimation  prob- 
lem, i.e.,  estimation  of  the  propagation  delay,  phase,  and  amplitude  for  all  users  in  a 
DS-CDMA  system.  Most  of  the  algorithms  proposed,  however,  deal  with  propagation 
delay  estimation.  The  reason  for  this  is  that  obtaining  accurate  propagation  delay 
estimates  is  a difficult  problem,  and  there  exist  many  algorithms  for  (suboptimally) 
estimating  the  remaining  parameters,  once  reliable  estimates  of  the  propagation  de- 
lays are  available  [34,  60,  9,  45].  No  knowledge  of  the  data  sequences  are  required  by 
the  proposed  algorithms.  This  means  that  the  algorithms  are  applicable  to  both  the 
acquisition  and  tracking  problems. 

The  original  work  of  this  chapter  is  (1)  the  derivation  of  the  Cramer-Rao  bound, 
(2)  the  formulation  of  the  maximum-likelihood,  approximative  maximum-likelihood, 
correlator,  and  MUSIC  estimators,  (3)  the  formulation  of  efficient  search  techniques 
of  the  MUSIC  and  correlator  cost  functions,  and  (4)  the  derivation  of  the  optimum 
dimension  reduction  transform. 

4.1  Cramer-Rao  Bound 

Since  the  derivation  of  the  Cramer-Rao  bound  (CRB)  is  somewhat  lengthy,  we 
defer  the  details  to  Appendix  A. 

The  parameter  estimation  problem  can  be  formulated  as  follows:  given  M obser- 
vations of  the  received  vector,  estimate  the  parameter  vector  0 G IR3^"1"1, 

0 = k2  1T  0T  rT]T  (4.1) 

where  a2,  y,  9 , and  t are  defined  by  (2.4),  (2.21)-(2.23).  We  will,  however,  concen- 
trate our  efforts  on  estimating  the  propagation  delay  vector  r.  In  fact,  we  will  not 
discuss  noise  variance  estimation  at  all.  Nevertheless,  we  will  still  assume  that  the 
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elements  of  4>  are  unknown  and  deterministic  parameters  that  are  fixed  during  the 
interval  for  which  we  observe  the  received  vector. 

The  CRB  is  a lower  bound  on  the  covariance  matrix  of  any  unbiased  estimator  [40]. 
Suppose  (p  is  an  unbiased  estimator  of  a vector  of  deterministic  unknown  parameters 
0 E[<i>\  = 0),  then  the  covariance  matrix  of  the  estimator  is  satisfies1 


E[(0  ~ 0)(0  - 0)T)]  > J 1 = ( E 


id  In  L(r)  \ fd  In  L(r)  \ T 

V d(\>  J \ d<f)  ) 


(4.2) 


where  J G ]r)1+3A')x(1+3^0  js  the  Fisher  information  matrix  and  lnL(r)  is  the  log- 
likelihood  function  of  the  observed  data  r with  respect  to  (f>. 

In  our  case,  the  observed  data  is  the  received  vector  r(m)  for  m = 1,  2, . . . , M; 
hence,  r = [ rr(l)  • • • r T(M)  ]T  e CMQAr.  In  order  to  make  the  problem  tractable, 
we  condition  the  likelihood  function  on  the  transmitted  bits,  or,  equivalently,  on 
z(m).  The  resulting  CRB  may  be  interpreted  as  the  CRB  on  estimators  that  are 
given  knowledge  of  the  transmitted  bits.  Nevertheless,  the  CRB  presented  here  will 
still  be  a valid  (although  maybe  not  a tight)  bound  on  the  performance  of  estimators 
based  on  the  model  (2.17). 


Since  the  noise  is  complex  Gaussian,  the  conditional  log-likelihood  function  (ig- 
noring terms  that  are  independent  of  </>)  is 

1 M 

In L(r)  = -MQNlna2  - - £ ||r(m)  - ABz(m)||2  (4.3) 

° m= 1 

In  order  to  calculate  J,  the  partial  derivative  with  respect  to  the  parameter  vector 
0 is  needed.  As  explained  in  Appendix  A,  it  can  be  seen  from  (2.12)  and  (2.13) 
that  d/ dr/-  A might  not  exist  when  6k  = 0.  Therefore,  in  order  to  proceed,  we  will 
assume  that  6k  0 for  k = 1, . . . , K.  This  rather  mild  restriction  means  that  we  are 
not  sampling  in  perfect  synchronization  with  any  of  the  K users;  furthermore,  the 
algorithms  proposed  below  are  not  dependent  on  this  assumption. 

1With  X > Y for  two  symmetric  positive  semidefinite  matrices  X and  Y,  we  mean  that  X - Y 

is  symmetric  positive  semidefinite. 


62 


It  is  shown  in  Section  A.l  that  the  information  matrix  may  be  written 

MQN/oa  0 0 0 


0 

0 

0 


J *y0  J -yr 

J7 0 J eo  J Ot 

Tr  TT  T 

“77  “77 


where  the  matrices  J77,  J7fl,  J7T,  J00,  J0r,  JTT  g IR*X*  are  defined  as 

2 M 


'77  - a2 


E R-e  {z*(m)B*A’AB7Z(m)} 

71=1 

M 

J70  = 4 E Re  {z*(ra)B*  A*AB0Z(m)} 

^ m=l 
2 M 

V = — E Re{z*(m)B;A**BZ(m)} 


m=l 
2 M 
a 


Jee  = -2  E Re{Z*(m)B;A*AB0Z(m)} 


m=l 
2 M 


J07  = -2  E Re  {Z*(m)BgA*\ErBZ(m)} 


m=l 

M 


Jtt  = ^2  E Re  {Z*(m)B*^*^BZ(m)} 


m=l 


where  B0  = jB, 

B7  = diag(expO^),  exp(j^), . . . , exp(j0K),exp(j0K))  e (C2Kx2K 

and  where  Z(m)  G (-1,0,  l}2KxK  is  defined  as 

' z2k-i(m)  if  i = 2k  — 1 

(Z(m))(i,  k)  = < 


and  ^ = [■?/>!  t/>2 


Z2k(m)  if  i = 2k 

0 otherwise 

1^2 k]  £ ]RQNx2K  is  defined  as 


dsi  1 

= jrPfe  + U)-  D(pt,  l)]c„ 


'I’lk 


Sa2ib  1 


•>Tt  Ti 


= ~ [D(p*+  1,-1)  - D(p*,-l)]ct 


(4.4) 

(4.5) 

(4.6) 

(4.7) 

(4.8) 

(4.9) 

(4.10) 

(4.11) 

(4.12) 

(4.13) 


(4.14) 
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It  is  also  shown  in  Section  A.l  that  the  asymptotic  Fisher  information  matrix, 


*1  oo : is 


loo  - lim  J = 

M— ►oo 


MQN/a 4 0 

® ^77,00 

0 0 J 


0 


Tr 

7r,oo 


0 

0 

99,  oo 

0 


0 

^7T,00 

0 


(4.15) 


where  J-yyi0o>  J-yT>00,  Jeo,ooi  Jrr,o o £ IRAxA  are 


M 

J77,oo  = ^G*diag(A*A)G 

(4.16) 

J7t,oo  = ^Gdiag(A*^)Gv/P 
a £ 

(4.17) 

M 

•Voo  = ^Gdiag(A*A)GP 

(4.18) 

M 

Jtt.oo  = ^Gdiag(tf**)GP 

(4.19) 

where  G G {0, \}2KxK  and  P G IR KxK  are  defined  as 


'1  if  i = 2k  - 1 
G(i,k)  = ll  if  * = 2* 

■ 0 otherwise 
P = diag(Pl5  P2,. . . , Pk) 


(4.20) 


(4.21) 


It  is  possible  to  invert  the  asymptotic  information  matrix  analytically,  since  all 
blocks  are  diagonal.  Furthermore,  the  structure  of  Joo  is  such  that  the  bound  on  the 
variance  of  an  unbiased  estimator  of  77,  6\,  or  t\  is  independent  of  the  parameters 
for  any  other  user.  In  particular,  the  bound  on  the  variance  of  a ri-estimator  is  in- 
dependent of  Pk  for  k = 2,3,...,  K.  This  is  important  since  it  gives  us  hope  that 
there  indeed  may  exist  estimators  that  are  near-far  resistant.  Practical  advantages 
with  the  asymptotic  expression  (4.15)  are  (1)  it  is  independent  of  the  realization  of 
z(m)  and  (2)  it  is  (much)  less  expensive  to  compute  than  the  finite  sample  expres- 
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sion  (4.4).  Furthermore,  as  indicated  in  Section  4.2.5,  the  finite  sample  information 
matrix  approaches  the  asymptotic  expression  for  rather  modest  values  of  M. 

4.2  Estimation  Algorithms 

4.2.1  Maximum-Likelihood 

The  maximum-likelihood  (ML)  estimator  is  found  by  maximizing  the  log-likelihood 
function  (4.3)  with  respect  to  tf>.  As  seen  from  (4.3),  the  optimization  problem  is 
separable  with  respect  to  cr2.  In  particular,  we  can  find  the  ML  estimates  of  7,  0 , 
and  r independently  of  a2. 

Maximizing  the  log-likelihood  function  is  equivalent  to  minimizing  the  negative 
log-likelihood  function  given  by  (constants  unaffected  by  the  optimization  procedure 
are  ignored) 

M 

£(7,  0,  t)  = ||r(m)  - ABz(ra)||2  (4.22) 

m=l 

Note  that  the  log-likelihood  function  is  conditioned  on  z (m)  for  m = 1,2, 

Thus,  minimizing  (4.22)  with  respect  to  0 requires  knowledge  of  the  vector  z E 
{ — 1,0,  lj2MK  defined  as 

z = [zT(l)  zt(2)  •••  zt(M)]t  (4.23) 

Clearly,  z is  unknown,  and  we  are  forced  to  detect  z from  the  observed  vectors.  Thus, 
the  maximum  likelihood  estimates  7ML,  0ML,  rML,  and  zML  are  found  as 

M 

= argmin  £ ||r(m)  - ABz(m)||2  (4.24) 

e m=  1 

T 
Z 

Note  that  the  maximum-likelihood  algorithm  estimates  y,  t,  01  and  z simultaneously. 
This  estimator  exploits  the  full  structure  of  the  problem  and  will  achieve  the  CRB  [40]. 


0ML 

-ML 

T 

ZML 
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The  optimization  of  the  log-likelihood  function  over  z as  well  as  over  0 is  un- 
fortunate, since  we  are  facing  an  optimization  problem  with  both  continuous  and 
discrete  parameters.  The  standard  algorithms  for  optimization,  e.g.,  Gauss-Newton 
or  Levenberg-Marquardt,  are  not  designed  to  work  with  discrete  parameters.  Never- 
theless, these  algorithms  can  still  be  used  to  find  the  maximum-likelihood  estimate 
of  0 as  follows.  Substitute  a realization  of  z into  the  log-likelihood  function,  mini- 
mize the  log-likelihood  function  with  respect  to  the  continuous  parameters,  save  the 
parameters  and  the  value  of  the  cost  function,  and  repeat  this  procedure  until  all 
possible  realization  of  z have  been  tested.  The  ML  estimates  are  then  found  as  the 
parameters  corresponding  to  the  smallest  value  of  the  cost  function.  The  problem  is 
thus  solvable  in  principle,  but  the  complexity  becomes  overwhelming  as  the  number 
of  users  or  the  observation  interval  grows. 

4.2.2  Approximative  Maximum  Likelihood 

The  ML  estimator  is  impractical  since  we  need  to  optimize  a cost  function  with 
respect  to  continuous  as  well  as  discrete  parameters.  However,  we  can  make  an 
approximation  that  in  essence  allows  us  to  “forget”  that  we  have  a mixed  optimization 
problem.  If  we  define 

s(m)  = Bz(m)  € (4.25) 

then  the  received  vector  may  be  written  as 

r(m)  = As  (m)  + n(m)  (4.26) 

If  we  assume  that  s (to)  consists  of  continuous,  unknown  and  deterministic  parameters, 
then  the  ML-estimate  of  s(m)  is  A^r(m),  where  the  matrix  A*  is  the  left  pseudoinverse 
of  A = A(r),  [13,  52],  Since  A has  full  rank,  we  can  compute  A*  as 


Af  = (A*A)_1A* 


(4.27) 
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The  approximative  maximum-likelihood  (AML)  estimate  of  r is  formulated  as 


~ AML 

T - 


M 


= arg min  ^ ||r(m)  - ABz(ra)||: 

m= 1 

= arg  mm  trace  (p^(t)Rm) 


Bz(m)=At  (r)r(m) 


(4.28) 


where  PA(-T)  = IqN  — A(r)A^(r)  is  the  orthogonal  projection  matrix  onto  the  sub- 
space [range(A(r))]J-,  and  R^  is  the  sample  correlation  matrix 

2 M 

*1m=M  12  r(m)r*(m)  (4.29) 

m=l 

Note  that  in  this  procedure,  we  are  not  using  the  full  structure  of  s(m)  indicated 
by  (4.25).  We  therefore  expect  (4.28)  to  yield  worse  estimates  than  (4.24).  Also,  note 
that  the  AML  algorithm  estimates  all  the  delays  simultaneously. 


4.2.3  MUSIC 


The  foundation  to  the  multiple  signal  classification  (MUSIC)  algorithm  [41],  orig- 
inally formulated  for  array  processing,  is  the  observation  that  the  columns  of  A are 
contained  in  the  signal  subspace,  range(Es),  or,  equivalently,  are  orthogonal  to  all 
vectors  in  the  noise  subspace  range(E„). 

Consider  the  vectors  bfc>i(r)  and  bfci2(r), 


bM(r) 


bfe,  2(r) 


Ti 


D(p+  1,1)  + 


D(p,l) 


c k 


5_ 

T 

-1  l 


D(p+  1,-1)  + 


D(p,-1) 


c k 


(4.30) 

(4.31) 


where  r — pTi  + 8 is  such  that  p is  an  integer  and  6 = [0,  Tfi.  We  observe  from  (2.12) 
and  (2.13)  that  a2*;_i  = b^^r*.)  and  a2*,  = b^^T*).  Thus,  given  knowledge  of  R we 
can  find  rfc  as  the  solution  to  ||E;bM(r)||2  = 0 or  as  the  solution  to  ||E;bfci2('r)||2  = 0. 
Moreover,  the  solution  is  unique  under  the  assumptions  that  2 K < QN  and  that  A 
has  full  rank  for  all  possible  realizations  of  r.  In  practice,  the  correlation  matrix  is 
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unknown  and  is  therefore  estimated  by  the  sample  correlation  matrix.  A consistent 
estimate  of  E„  is  found  in  the  eigenvalue  decomposition  of  RM, 

Rm  = Es  ASE*  + E„AnE*  (4.32) 


where  the  columns  of  En  are  the  eigenvectors  corresponding  to  the  QN-2K  smallest 
eigenvalues  of  R^.  Note,  however,  that  the  columns  of  A will  now  be  only  approxi- 
mately orthogonal  to  the  columns  of  En. 

The  original  MUSIC  algorithm  is  formulated  for  the  case  where  each  column  in  A 
is  a function  of  a distinct  parameter.  However,  that  is  not  the  case  here.  One  possible 
modification  of  the  MUSIC  algorithm  is  as  follows.  Let  the  MUSIC  estimate  of  rk  be 
defined  as 


— arS  min  «4,mu(t) 

re[0,T) 

where  Jk,Mu{T)  *s  the  MUSIC  cost  function  for  the  kth  user  defined  as 

jkMV(r)  = + l^bfc’2(r)ll2 


(4.33) 


IVi(T) 


llb*,2(T)||5 


(4.34) 


As  seen  from  (4.30)  and  (4.31),  bM(r)  and  b ka(r)  are  piecewise  linear  in  r.  In 
particular,  for  r G \pTu  (p  + 1)7*),  referred  to  as  the  pth  bin  of  r, 


bM(r)  = ^b M([p  + l]Tj)  + ^ b kA(pTi)  (4.35) 

b *,2(r)  = ^bM([p  + l]Tj)  + hhfl(pTi)  (4.36) 

By  substituting  (4.35)  and  (4.36)  into  (4.34)  and  noting  that  if  p is  an  integer  then 
I|bfc,i(p7i)||2  = ||bfci2(pTi)||2  = QN,  we  obtain  that  for  r G [pT*,  (p+  1)T,), 

T _ s2Ku  + (1  — s)2Ki2  + 2s(l  — s)A13 

^ > (S2  + (!  _ s)2)QN  + 2s(l  _ s)Ku 

s2  K21  + (1  — s)2  K22  + 2s(l  — s)A2  3 
(s2  + (1  - s)2)QN  + 2s(l  - s)K24 


(4.37) 
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where  we  have  defined  s = 5/Ti  and 


Ku  = ||E;bM([p+l]Ti)||2  (4.38) 

Kn  = ||E£bjb,1(Pr*)l|2  (4.39) 

Kl3  = R e{bj>1([p  + l]Ti)E„E  *nbk,i(pTi)}  (4.40) 

Ku  = bfctl([p  + l]Ti)hk>i{pTi)  (4.41) 


and  where  K2 i,  K22,  K23,  and  K2i  are  defined  in  an  analogous  manner  by  changing 
bfc,i  to  bfe2  in  (4.38)-(4.41). 

From  (4.37),  we  see  that  Jk< muC?-)  can  be  written  as  a rational  function  of  two  poly- 
nomials of  degree  four,  and  furthermore  that  Jfc,Mu(r)  is  differentiable  for  r G [0,T), 
except  when  r = pT)  for  an  integer  p.  This  suggests  an  efficient  search  procedure  of 
the  cost  function.  We  form  the  set  T of  candidate  MUSIC  estimates  as  the  union  of 
the  set  {0,  Tj,  27j, . . . , ( QN  - 1)7)}  and  the  set  of  real  solutions  to  the  equation 


dJk,  mu(t~) 
dr 


(4.42) 


The  solutions  to  (4.42),  i.e.,  the  roots  to  the  numerator  polynomial  of  (4.42),  can  be 
found  efficiently  by  numerical  methods.  Finally,  ffcMU  is  found  as  the  member  of  T 
corresponding  to  the  smallest  value  of  the  cost  function. 

Note  that  the  MUSIC  algorithm  finds  one  element  of  r at  a time. 


4.2.4  Sliding  Correlator 

The  standard  approach  for  propagation  delay  estimation  is  the  so-called  sliding 
correlator  [33].  Much  as  the  standard  receiver  (i.e.,  the  matched  filter),  the  sliding 
correlator  is  only  optimum  in  a single-user  situation  or  when  the  received  code  wave- 
forms are  orthogonal.  It  is  therefore  expected  that  this  scheme  will  fail  in  a near-far 
situation.  However,  for  the  purpose  of  comparison,  we  will  now  describe  an  algorithm 
that  is  analogous  to  the  sliding  correlator. 
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If  we  constrain  dk(m)  = 1 for  m = 1, . . . , M,  we  see  from  (2.8)  that  the  contribu- 
tion from  the  kth.  user  to  the  received  vector  is  /3ka2k-i.  Thus,  since  a2fc_i  = b^^r*) 
is  roughly  orthogonal  to  {a 2j_u  a2j}  for  j ± k,  and  since  {a2j_l5  a2j}  are  independent 
of  rk  for  j ^ k,  we  can  form  the  correlator  estimate  of  rk  as 


rk  = arg  min  Jk  Jr) 
k r*e[o,r)  *’cV  ’ 


(4.43) 


where  JktC(r ) is  the  sliding  correlator  cost  function  for  the  A;th  user 


Jk,c(r)  — 


l|bw(r)|| 


lbM(r)rMp 

Ilbfc,i('r)||2 


(4.44) 


and  where  rM  = As  seen  from  this  equation,  the  correlator,  as  the 

MUSIC  algorithm,  estimates  one  propagation  delay  at  a time. 

We  now  form  two  equations  analogous  to  (4.37)  and  (4.42).  For  r 6 \pTu  (p+l)Tj), 
Jk,c(T)  can  be  written  as 


r.  «2|Ci-C0|2  + 2SRe{(C1-C'o)Q}  + |Co|2 

(s2  + (1  — s)2)QN  + 2s(l  — s)bkl(\p  + \}Ti)ak'i{pTi)  (4‘45j 

where  C\  — x([p  + l]Tj)rM  and  Co  = b*k  l(pTi)rM-  After  differentiating  (4.45) 
with  respect  to  r,  we  note  the  resulting  numerator  is  of  degree  two.  The  solution 
to  dJk^(r)/dr  — 0 can  therefore  be  found  analytically,  and  the  search  for  a global 
minimum  can  be  carried  out  in  the  same  manner  as  in  Section  4.2.3. 

Note  that  the  MUSIC,  ML,  and  AML  algorithms  do  not  assume  that  dk(m)  — 1. 
Therefore,  in  the  simulations  reported  in  Section  4.2.5,  the  correlator  is  given  some 
extra  side-information,  and  the  comparisons  with  the  other  algorithms  are  actually 
biased  in  favor  of  the  correlator. 


4.2.5  Numerical  Results 

In  this  section,  the  performance  of  the  AML,  MUSIC,  and  correlator  algorithms 
is  evaluated  by  Monte-Carlo  simulations. 


70 


The  simulated  system  was  a 10-user  system  with  N = 31  chips  per  bit  Gold 
sequences,  generated  by  the  polynomials  gx (x)  = x5  + x + 1 and  g2(x ) = x5  + x4  + 
x3  + x2  + 1,  [39].  Oversampling  was  not  used,  Q = 1,  and  the  signal-to-noise  ratio  of 
the  first  user  was  fixed  at  SNRj  = 2Ebti/No  = 15  dB.  The  near-far  ratio  is  defined 
as  P2/Pi,  where  all  interfering  users  have  the  same  received  power,  Pk  = P2  for 
k = 3,4,...,  10. 

For  each  Monte-Carlo  run,  independent  realizations  of  the  noise  and  data  se- 
quences were  created;  however,  the  propagation  delays,  received  amplitudes,  and 
phases  were  fixed.  The  measure  of  performance  for  an  estimator  is  the  sample  stan- 
dard deviation,  std(fi), 


where  R = 500  is  the  number  of  Monte-Carlo  runs  that  were  performed  for  each 
simulation.  The  estimators  turned  out  to  be  approximately  unbiased,  and  hence  the 
sample  standard  deviation  is  an  appropriate  performance  measure. 

The  absolute  performance  of  the  propagation  delay  estimators  is  of  course  depen- 
dent on  the  particular  t for  which  the  simulations  were  done;  however,  the  relative 
performance  of  the  estimators  proved  to  be  roughly  independent  of  the  realization  of 
the  delays. 

The  Cramer- Rao  bound  serves  as  a lower  bound  on  the  standard  deviation.  The 
asymptotic  CRB,  given  by  (4.15),  was  used  since  the  ordinary  CRB  is  dependent  on 
the  data  sequences,  which  were  changed  between  Monte-Carlo  runs.  Moreover,  the 
actual  CRB  was  very  close  to  the  asymptotic  expression  if  the  observation  interval 
M was  greater  than  20. 

The  optimization  of  the  MUSIC  and  sliding  correlator  cost  functions  was  carried 
out  as  described  in  Sections  4.2.3  and  4.2.4.  This  guarantees  that  the  global  minimum 
is  found.  The  cost  functions  are  highly  nonlinear,  and  sometimes  the  estimates  found 


(4.46) 
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Figure  4.1.  Standard  deviation  of  the  AML:  (x),  the  MUSIC:  (o),  and  the  correlator 
estimators:  (+).  The  solid  line  is  the  Cramer- Rao  Bound. 

were  more  than  one  chip  duration  away  from  the  true  delay.  Such  estimates  are  called 
outliers.  The  error  due  to  an  outlier  estimate  may  be  very  large.  Therefore,  to  make 
the  comparison  with  the  CRB  meaningful,  the  outlier  errors  were  removed  before  the 
standard  deviation  was  computed 

An  outlier  error  is  of  course  disastrous  in  the  acquisition  phase.  However,  in  the 
tracking  phase,  we  will  probably  limit  the  search  of  the  cost  function  to  be  in  the 
vicinity  (say,  plus  minus  TJ 2)  of  the  previous  estimate.  This  is  reasonable  since  the 
propagation  delays  are  typically  slowly  varying.  By  limiting  the  search,  the  algorithm 
will  not  find  an  outlier  estimate. 

The  MUSIC  estimator  produced  very  few  outliers  when  more  than  M = 100  vector 
samples  were  observed.  For  M = 50  and  near-far  ratio  20  dB,  the  relative  number  of 
outliers  increased  to  about  20%.  On  the  other  hand,  the  sliding  correlator  had  a high 
number  of  outliers  in  a near-far  situation  even  if  the  observation  interval  was  made 
long.  When  the  near-far  ratio  was  20  dB,  the  outlier  ratio  was  50%  for  M = 100  and 
13%  for  M = 400. 

In  case  of  the  AML  algorithm,  the  optimization  was  carried  out  by  a numerical 
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Figure  4.2:  Standard  deviation  of  the  AML:  (x),  the  MUSIC:  (o),  and  the  correlator 
estimators:  (+ ).  The  solid  line  is  the  Cramer-Rao  Bound,  and  the  number  of  observed 
vector  samples  is  M = 100. 

search  started  in  the  true  point.  The  cost  function  is  such  that  a local  minimum  is 
often  located  very  close  to  the  true  point.  Therefore,  there  is  a lower  risk  of  finding 
an  outlier  error,  and  the  performance  of  the  AML  algorithm  shown  here  is  therefore 
slightly  optimistic. 

In  Figure  4.1,  the  standard  deviations  of  the  AML,  MUSIC,  and  correlation  al- 
gorithms are  plotted  versus  the  number  of  observed  vectors,  and  in  Figure  4.2,  the 
standard  deviations  are  plotted  as  functions  of  the  near-far  ratio.  Note  that  the  MU- 
SIC and  AML  algorithms  performed  similarly  and  were  unaffected  by  the  near-far 
problem.  However,  the  correlator  algorithm  was  very  sensitive  to  interference.  Fur- 
thermore, none  of  the  estimators  attained  the  Cramer-Rao  bound.  Thus,  the  ML 
estimator  would  have  had  the  best  performance,  but  also  largest  complexity. 

The  results  presented  here  were  obtained  with  SNRt  = 15  dB.  The  MUSIC  es- 
timator seems  to  be  more  noise  sensitive  than  the  correlator  and  AML  algorithms. 


Variance  of  t\  estimate 


5 10  15  20 

Near-Far  Ratio  [dB] 
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With  good  power  control  and  where  the  SNR  is  low,  the  correlator  works  as  good  as 
or  better  than  the  MUSIC  and  AML  estimators.  When  SNRj  is  low  and  the  near-far 
ratio  is  high,  the  AML  is  probably  the  method  of  choice.  However,  when  SNRi  is 
reasonably  high  and  the  interference  significant,  the  MUSIC  algorithm  is  attractive 
since  it  has  a much  lower  complexity  than  the  AML  method. 

4.3  Dimension  Reduction 

Many  good  estimators  have  a computational  complexity  on  the  order  of  ( QN )3, 
where  QN  is  the  dimensionality  of  the  observed  data.  One  possibility  to  lower  the 
complexity  of  the  algorithms  is  to  reduce  the  dimensionality  of  the  received  vector 
by  premultiplying  with  V*,  where  V 6 C QNxL.  Obviously,  we  would  like  to  choose 
V such  that  it  is  optimum  in  some  sense.  If  we  restrict  our  attention  to  unbiased 
estimators,  the  natural  choice  of  performance  measure  is  the  Cramer-Rao  bound. 

4.3.1  Optimum  Dimension  Reduction 

Since  we  are  mostly  interested  in  estimating  7,  9 , and  r,  we  define  the  optimum 
transform  V G &QNxL  to  be  such  that  the  CRB  for  these  parameters  is  conserved. 
That  is,  V is  an  optimum  transform  if  the  CRB  for  an  estimator  based  on  the  re- 
ceived vectors  (r(l), . . . , r(M)}  is  the  same  as  the  CRB  for  an  estimator  based  on 
{V*r(l), . . . , V*r(M)}.  Note  that,  by  definition,  no  V exists  such  that  the  CRB  is 
reduced.  The  idea  of  dimension  reduction  is  not  new,  and  the  work  presented  in  this 
section  is  inspired  in  many  respects  by  Anderson  [2], 

The  reduced  dimension  received  vector,  rv(m)  e CL,  can  be  written  as 

r v(m)  = V*r(m)  = V*ABz(m)  + V*n(m)  = AvBz(m)  + n v(m)  (4.47) 
where  we  have  defined 


Ay  = V*A  = [a^i  •••  av,2*]eCLx2* 


(4.48) 
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nv(m)  = V*n(m)  (4.49) 

We  now  make  the  assumption  that  V*V  = 1^.  Since  n(m)  is  zero-mean  complex 
Gaussian  and  has  second  moment  matrices  as  given  by  (2.7),  nv(m)  = V*n(m)  will 
also  be  zero-mean  complex  Gaussian  with  second  moment  matrices 


E[ny(m)<(A;)]  = V*  E[n(m)n*(fc)]V  = V*(a2IN5K(k  - m))V 

= a2ILSK(k-m)  (4.50) 

E[nv(m)n£(fc)]  = 0 (4.51) 


Thus,  the  log-likelihood  function  (ignoring  constants  independent  of  <J>)  is 


j M 

lnL(iv)  = -ML\na2  - — £ ||w(m)  - AyBz(m)||2  (4.52) 

° m=  1 

Moreover,  if  we  assume  that  the  columns  of  Av  are  linearly  independent  for  all 
possible  realizations  of  r and  exclude  the  case  when  5k  = 0 for  any  k - 1, 2, . . . , K, 
then  the  Fisher  information  matrix  for  an  unbiased  estimator  based  on  rv  will  be 


as  in  (4.4)  with  the  appropriate  substitutions,  A t—  Av,  'T  t—  \I>y  = V*^,  and 


QN  <r~  L, 

Jv  = 

where 


ML/cr4 

0 

0 

0 ' 

0 

Jv,77 

Jv,70 

Jv,7  t 

0 

Tr 

jvvy0 

Jv,00 

J V,0T 

0 

JV,JT 

1T 

JV,0T 

Jv,tt  . 

(4.53) 


2 il 


V,77  = -2  £ Re  {z*(m)B*  A^Av/B7Z(m)} 

® m= 1 

(4.54) 

2 m 

vao  = ^£  Re  {z*(m)B*  AyAv/B0Z(m)} 

^ m=l  J 

(4.55) 

2 M 

E7t  = -j  £ Re {z*(m)B*  Ay'JryBZ(m)} 

^ m=l 

(4.56) 
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2 M 

J v,eo  = — Re{Z*(m)B0A^Av/B0Z(m)}  (4.57) 

° m=  1 
2 ^ 

J^r  = -j  £ Re{Z*(m)B;A*u^vBZ(m)}  (4.58) 

a m=l 
2 ^ 

Jv,tt  = ^E  Re{Z*(m)B*tf^BZ(m)}  (4.59) 

° m=  1 

The  form  of  the  optimum  dimension  reduction  transform  is  given  in  the  following 
theorem. 

Theorem  4.1  If  V e CQWxL  is  such  that  V*V  = IL  and 

range  ([  A ^ ])  C range(V)  (4.60) 

then  the  CRB  for  an  estimator  of  the  parameter  vector  (f>  = [ 0T  tt  ]t  is  the 
same  for  an  estimator  based  on  the  observed  data  {r(l), . . . , r(A/)}  as  for  an  estimator 
based  on  {V*r(l), . . . , V*r(M)}. 

Proof:  The  CRB  with  respect  to  is  given  as  the  inverse  of  the  lower  right- 
hand  3 K x 3 K block  of  the  Fisher  information  matrix  (4.53).  To  prove  the  theo- 
rem, it  is  therefore  sufficient  to  show  that  this  block  of  the  information  matrix  is 
the  same  for  an  estimator  based  on  (r(l), . . . , r(M)}  as  for  an  estimator  based  on 
{V*r(l), . . . , V*r(M)}. 

Since  the  columns  of  V form  an  orthonormal  basis  for  W = range(V)  C (D^,  the 
orthogonal  projection  matrix  Pvv  onto  W is  found  as  P\y  = VV*.  By  assumption, 
range([A  \Er])  C W and  therefore 

VV*A  = a (4.61) 

VV*  fit  = ^ (4.62) 

This  implies  that  A^Ay  = A*VV*A  = A*A  and,  in  the  same  manner,  that 
= ^r*^r  and  A*v^v  = A*\F.  It  is  now  seen  that  the  blocks  of  3V,  (4.54)- 
(4.59),  and  J,  (4.5)-(4.10),  are  equivalent,  and  the  theorem  follows.  ■ 
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If  we  assume  that  [A  'T  ] has  full  rank,  the  theorem  says  that  we  can  lower  the 
dimension  of  the  received  vectors  from  QN  to  4 K without  loss  of  performance  in  the 
CRB  sense. 

It  is  easy  to  show  that,  as  M -4  oo,  the  MUSIC  estimates  of  rk  based  on 
{r(l), . . . , r(M)}  and  {ry(l)., . . . , ry(M)},  will  be  identical  and  equal  to  rk.  This, 
however,  is  only  a statement  of  consistency  and  does  not  imply  that  the  estimator 
variance  will  decrease  with  M at  the  same  speed  in  both  cases.  Nevertheless,  in  the 
next  section,  we  will  demonstrate  experimentally  that  the  optimum  transform  indeed 
has  this  desirable  property. 

4.3.2  Numerical  Results 

The  optimum  transform  conserves  the  CRB  which  is  a lower  bound  on  the  variance 
of  an  unbiased  estimator.  Thus,  there  is  no  guarantee  that  a suboptimum  method  will 
perform  equally  well  before  and  after  dimension  reduction.  However,  in  this  section, 
we  will  verify  this  result  for  the  MUSIC  estimator  by  simulations. 

The  simulated  system  was  a 2-user  system  with  Q = 1,  N = 15  chips  per  bit,  Tc  = 
1,  SNRj  = 2Eb^/N0  = 15  dB,  and  P2  = Pi  (i.e. , perfect  power  control).  The  code 
sequences  are  two  maximum  length  sequences  generated  by  the  polynomials  gx  (x)  = 
x4  + x + 1 and  <72(2)  = + x3  + 1,  [39].  The  propagation  delay,  T\,  was  estimated  by 

the  MUSIC  estimator  based  on  both  {r(l), . . . , r (M)}  and  {rv(l), . . . , rv(M)},  where 
V is  an  optimum  transform,  M G {50,100,200,300,500},  and  L = 4K  = 8.  This 
L represents  the  largest  dimension  reduction  that  can  be  achieved  by  any  optimum 
transform. 

As  seen  from  Figure  4.3,  no  performance  degradation  of  the  timing  estimator  was 
noticed  even  though  the  dimension  of  the  received  vector  was  reduced  by  almost  a 
factor  of  two. 

Recall,  however,  that  the  optimum  transform  requires  knowledge  of  the  propaga- 
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Figure  4.3:  CRB:  solid  line,  MUSIC  estimate  before  dimension  reduction:  (x)  and 
MUSIC  estimate  after  dimension  reduction:  (+). 

tion  delays.  In  other  words,  we  cannot  do  optimum  dimension  reduction  before  we 
know  what  we  want  to  estimate.  Nevertheless,  we  can  still  envision  a recursive  scheme 
in  which  we  estimate  the  propagation  delays,  compute  the  optimum  transform  based 
on  the  estimates,  and  then  update  the  estimates  based  the  new  transform.  This  might 
still  give  us  a lower  total  complexity,  since  many  estimators  typically  require  on  the 
order  of  ( QN )3  operations  which  can  be  lowered  to  on  the  order  of  Z,3  operations  per 
iteration. 


4.4  Conclusions 

This  chapter  considered  the  parameter  estimation  problem  for  DS-CDMA  systems 
operating  over  AWGN  channels. 

The  Cramer- Rao  bound  (CRB)  was  presented  in  Section  4.1.  The  CRB  is  of 
great  importance  since  it  is  a bound  on  the  best  accuracy  with  which  parameters 
can  be  estimated  by  any  unbiased  estimator.  In  other  words,  an  unbiased  estimator 
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whose  variance  attains  the  CRB  is  optimum.  The  asymptotic  CRB  (i.e.,  the  CRB 
as  the  observation  interval  goes  to  infinity)  was  also  derived.  By  studying  the  form 
of  asymptotic  CRB,  we  concluded  that  there  may  indeed  exist  near-far  resistant 
estimators. 

In  Section  4.2.1,  the  optimum  maximum-likelihood  (ML)  estimator  was  defined. 
Unfortunately,  the  ML  estimator  is  very  complex  and  thus  of  limited  practical  value. 
To  overcome  this  deficiency,  two  suboptimum  estimators  were  proposed,  the  approx- 
imative maximum-likelihood  (AML)  algorithm  and  the  MUSIC  algorithm  in  Sec- 
tion 4.2.2  and  4.2.3,  respectively.  Of  these  schemes,  the  MUSIC  algorithm  has  the 
lower  complexity.  In  Section  4.2.5,  the  performance  of  these  estimators  was  compared 
with  the  CRB  and  the  performance  of  the  correlator  algorithm.  The  correlator  al- 
gorithm, which  is  analogous  to  the  standard  sliding  correlator,  was  shown  to  work 
satisfactory  when  the  multiuser  interference  is  low.  However,  in  a near-far  situation, 
the  performance  is  severely  degraded.  The  AML  and  MUSIC  algorithms  were  shown 
to  have  similar  performance  and  to  be  robust  against  multiuser  interference.  Further- 
more, they  impose  no  constraints  on  the  transmitted  data  sequences — in  contrast  to 
the  sliding  correlator — and  can  therefore  be  used  for  tracking  as  well  as  for  acquisition. 

Finally,  in  Section  4.3,  complexity  reduction  through  dimension  reduction  was 
proposed.  The  optimum  transform,  i.e.,  the  transform  that  conserves  the  CRB,  was 
derived.  The  optimum  transform  requires  knowledge  about  all  users’  code  waveforms 
and  propagation  delays.  Thus,  since  the  propagation  delays  are  unknown,  the  opti- 
mum transform  is  unknown.  However,  as  mentioned  in  Section  4.3.2,  we  can  still  use 
the  optimum  transform  in  a practical  manner. 


CHAPTER  5 

SENSITIVITY  ANALYSIS  OF  THE  DECORRELATING  RECEIVER 


In  this  chapter  we  will  present  a sensitivity  analysis  and  simulation  of  the  pop- 
ular decorrelator  detector  with  respect  to  propagation  delay  estimation  errors.  The 
notation  used  in  this  chapter  is  different  from  the  notation  in  the  rest  of  this  disser- 
tation. The  reason  for  this  is  to  be  consistent  with  the  notation  that  is  widely  used 
in  the  literature.  This  chapter  is  self-contained,  and  author  hopes  that  the  notational 
change  will  not  be  too  confusing. 

The  original  work  in  this  chapter  is  (1)  the  extension  of  the  Lupas-Verdu  system 
model  to  account  for  propagation  delay  estimation  errors,  (2)  the  proof  of  the  exis- 
tence of  a near-far  resistant  detector  in  the  presence  of  imperfect  propagation  delay 
estimation,  and  (3)  the  experimental  results  reported  in  Section  5.5. 

5.1  Introduction 

The  decorrelating  receiver  of  Lupas  and  Verdu  [24,  22]  as  well  as  several  interfer- 
ence cancellation  schemes  [11]  have  gained  considerable  interest  lately.  The  reasons 
for  this  are  that  the  standard  matched  filter  receiver  will  not  function  satisfactory  in 
a near-far  environment,  and  that  the  complexity  of  the  optimum  receiver  renders  it 
unfeasible  [57]. 

Near-far  resistant  structures  are  typically  dependent  on  accurate  side-information, 
e.g.,  the  number  of  users,  each  user’s  code  waveform,  received  power,  carrier  phase, 
and  propagation  delay.  The  number  of  users  and  their  code  waveforms  might  be  read- 
ily available,  but  the  other  parameters  must  be  estimated.  It  is  therefore  of  interest 
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to  evaluate  the  performance  of  the  receivers  when  the  available  side-information  is 
contaminated  with  estimation  errors. 

In  this  chapter  we  will  study  the  decorrelating  receiver.  In  its  original  form,  the 
decorrelator  requires  knowledge  of  the  number  of  users,  each  user’s  code  waveform, 
received  carrier  phase,  and  propagation  delay.  For  reasons  explained  below,  we  will 
assume  that  the  phase  is  estimated  perfectly.  We  will,  however,  allow  the  propagation 
delay  estimates  to  contain  estimation  errors,  and  derive  expressions  for  the  error 
probability,  asymptotic  efficiency,  and  near-far  resistance. 

5.2  System  Model 

The  objective  with  this  section  is  to  extend  the  system  model  of  Lupas  and  Verdu 
to  account  for  propagation  delay  estimation  errors  [22,  57], 

The  received  signal  is 

r(t)  = S(t,b) + n(t)  (5.1) 

where  n(t)  is  a white  Gaussian  waveform  with  power  spectral  density  N0/2.  In  general, 
a subscript  k implies  that  the  quantity  is  due  to  the  /cth  user.  For  example,  bk(i)  E 
{-1, 1}  is  the  kth  user’s  zth  bit.  The  signal  part,  S(t,  b),  is  found  as 

M K 

S(t,b)=  E J2b^)\l^k{i)sk(t  - iT  - rk)  (5.2) 

i=—M  k=  1 

where  K is  the  number  of  users,  N — 2M  + 1 is  the  number  of  transmitted  bits 
per  user,  wk(i)  is  the  received  bit  energy,  rk  E [0,T)  is  the  propagation  delay,  and 
sk(t)  = 0 for  t £ [0,  T),  where  T is  the  bit  duration.  Furthermore,  the  code  waveform, 
sk(t),  is  normalized, 

/OO 

s2k(t)dt=  1 (5.3) 

-OO 

The  vector  of  transmitted  bits,  b 6 {-1,1}^,  is  defined  as 

NK 


b = [bT(— M)  •••  bT(M)]T  € {-1, 1} 
b(>)  = [6i(i)  •••  ijcMfet-U}* 


(5.4) 

(5.5) 
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The  receiver  front-end  consists  of  a bank  of  matched  filters.  The  output  of  the 
kth.  user’s  matched  filter  sampled  at  the  ith  bit  interval  is 


It  is  easy  to  show  that  {yk{i)}  for  k = 1, . . . , K and  * = -M, . . . , M is  a sufficient 
statistic  for  detecting  b. 

Note  that  we  need  to  form  sk(t  — rk)  in  order  to  perform  matched  filtering.  This 
means  that  we  need  to  know  the  carrier  phase  [which  is  implicitly  included  in  sk(t)] 
and  rk.  In  practice,  the  phase  and  propagation  delay  are  unknown  and  must  be 
estimated.  We  choose  to  ignore  the  phase  error  since  we  are  mostly  concerned  with 
the  near-far  problem.  This  is  justified  by  the  following  argument.  Suppose  that 
all  users’  received  carrier  phases  are  equal.  Consider  the  signal  after  down-mixing, 
i.e.,  after  have  multiplied  the  received  signal  with  a replica  of  the  carrier  (offset  by 
the  estimated  phase)  and  retained  the  baseband  signal  (the  lowpass  portion  of  the 
resulting  signal).  The  down-mixing  procedure  only  scales  the  users’  contributions  to 
the  received  signal  by  a common  factor.  The  relative  power  levels  between  the  users 
are  therefore  unaffected  by  the  phase  estimation  error,  and  the  interference  situation 
is  unchanged.  However,  the  signal-to-noise  ratio  will  be  affected,  but  since  we  assume 
that  the  system  is  interference  limited,  we  ignore  this  problem. 

Let  the  estimate  of  rk  be  denoted  by  fk.  We  assume  that  the  estimation  errors 
are  such  that  fk  E [0,T).  If  we  assume  a correlator-style  matched  filter,  the  sampled 
matched  filter  output  is 


Vk{i)  = I r(t)sk(t  -iT-  rk)  dt 


(i+l)T+Tk 


(5.6) 


(i+l)T+fk 


(5.7) 


where 


(5.8) 
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is  a zero-mean  Gaussian  random  variable  with  crosscorrelation 

Nn  r°° 

EK(*)«/(i)]  = y L Sk(<t  ~iT~  ffc) si (*  ~ iT  ~ n) dt  (5.9) 

If  we  define 

£ r oo 

Rhj(l)=  8k(t  - Tk)Sj(t  + IT  - Tj)  dt  (5.10) 

J — oo 

„ r oo 

Rkj{l)  = sk(t- Tk)Sj(t  + lT  - Tj)dt  (5.11) 

J—  OO 

then  E[nfc(*)m0')]  = Rki(i~j)N0/2.  Recall  that  sfe(t)  = 0 for  t <£  [0,  T),  which  implies 
that  Rkj(l)  — Rkj( l)  = 0 for  |/|  > 1.  It  is  easy  to  show  that 

M K 

Vk{i)  = nk(i)  + Y,  biti)\/wi{j)Rki(i  ~ j) 
j=-M  1= 1 


- Mz)  + 1 )\Jwi(i  - l)Rfc/(l)  + bi(i)\J wi(i)Rki{ti) 


1=1 

K 

i=i 

+ 53M*  + l)\lwi(i  + l)-Rw(-l) 
1=1 

(5.12) 

where  we  assume  that  bi(—(M  + 1))  = 
Lupas  [22],  we  can  write 

= bi(M  + 1)  = 0.  Following  the  same  steps  as 

y = 

= TZWb  + n 

(5.13) 

where  we  have  defined 

y = [y t(-m)  ■ 

••  yT(M)]TeJRNK 

(5.14) 

y(*)  = [yi(0  ••• 

y/r(i)]T  G JR* 

(5.15) 

n = [nr(—M)  ■ 

••  nT(M)]TeJRNK 

(5.16) 

n(»)  = [»i(0  ••• 

nK(i)  ]T  G IRa 

(5.17) 

W = diag(W(— M), . . . , W (M))  G IR^X^ 

(5.18) 

W(i)  = diag  (y/wi(i), 

(5.19) 
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R(0)  R(— 1)  0 •••  0 

R(l)  R(0)  R(— 1)  ; 

h=  0 R(l)  R(0)  •••  0 (5.20) 

: •••  R(-1) 

0 •••  0 R(l)  R(0)  _ 

where  the  (i,  j)th  element  of  the  matrices  R(Z)  e and  R(Z)  g JR*XJif  are 

(R(/))(i,j)  = Rij(l)  and  = Rij(l),  respectively.  The  noise  vector  n is  a 

Gaussian  zero-mean  vector  with  correlation  matrix 


E|nnT]  = Tre 


(5.21) 


where  matrix  7 Z is  defined  as  7Z  in  (5.20)  with  the  obvious  substitutions  R (/)  g-  R(/). 

5.3  Analysis  of  Linear  Receivers 

A linear  receiver  is  defined  in  terms  of  its  receiver  vector  v G IR^  and  forms  the 
decision  of  the  kth  user’s  ith  bit  as1 

bk(i)  = sgn[vTy]  = sgn[vr7?.Wb  + vrn]  (5.22) 

The  noise  term  vrn  is  seen  to  be  a zero-mean  Gaussian  random  variable  with  variance 

(5.23) 


An 


E[(vTn)2]  = vrE[nnr]v  = -^-wT'R,\ 

Suppose  we  let  bk(i)  = 1,  then  the  probability  of  error  conditioned  on  b is 


Pe  |b  — Q 


^ vTnwh  ^ 


(5.24) 


WfvTKv/ 


and,  assuming  equally  likely  independent  bits,  the  probability  of  error  is 

Pe  ~ pe\b  Pr[b  I bk(i ) = 1] 


1The  receiver  vector  is,  of  course,  allowed  to  change  with  k and  i.  However,  the  dependence  of  v 

on  k and  i is,  for  notational  simplicity,  not  explicitly  indicated. 
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= E 


bef-M}** 

bk(i)=l 


2nk 


xrWb 


^vrnv  ) 


Xr  = vt7 Z = [xt(-M)  •••  xt(M)  ]r 

x(*)  = [®i(*)  •••  ®jf(*)]T 


(5.25) 

(5.26) 

(5.27) 


To  compute  the  probability  of  error  we  need  evaluate  and  sum  2NK~ 1 Q-functions. 
This  clearly  gets  out  of  hand  even  for  rather  small  N and  K . However,  if  we  are 
mostly  concerned  with  the  near-far  problem,  we  can  also  measure  performance  in 
terms  of  the  asymptotic  efficiency  and  the  near-far  resistance  [24],  The  asymptotic 
efficiency  for  the  kth  user’s  ith  bit  is  defined  as 

= ™P  jo  < r < 1 : Um  P.  / Q < ooj  (5.28) 

In  general,  rjkj  will  be  dependent  on  wi(j),  since  Pe  is  a function  of  wi(j).  We  therefore 
define  the  near- far  resistance  of  a receiver,  fjk ,i,  as  the  worst  case  asymptotic  efficiency 


Vk,i=  inf  (5.29) 

Wk(i)  const. 

The  inner  product  xTWb  can  be  written  as 


xTWb  = xk(i)y/wk(i)  + Y xiti)y/Mj)bi(j)  (5.30) 

(ffiV(M) 

As  N0  -4  0,  the  error  probability  will  be  dominated  by  the  Q-function  with  the 
smallest  argument.  The  smallest  argument  squared  is 


2 

N0vTilv 


(i)y/wk(i)  - Y 


2 


2 wk(i)  1 
N°  vr7 Zv 


- E M)\ 


2 


(5.31) 
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Thus,  the  asymptotic  efficiency  is 


vT7lv 


Mj) 

\ wk(i) 


(5.32) 


and,  clearly,  the  near-far  resistance  is 


if  x i(j)  = 0,  for  all  (l,j)  ± (k,i) 


(5.33) 


0 otherwise 


5.4  Analysis  of  the  Decorrelating  Receiver 


The  decorrelating  receiver  for  the  case  of  imperfect  matched  filtering  is  defined  as 
the  receiver  whose  receiver  vector  satisfies  1lTv  = u*,*,  where  uM  e JRNK  is  the  unit 
vector  Ukti(n)  = 6K(n  - [(i  + M)K  + k]).  If  we  assume  that  H is  invertible  (which 
is  reasonable  for  small  estimation  errors),  then  vT  = u^AT1,  and  xT  = xTH  = u^. 
The  asymptotic  efficiency  for  the  decorrelating  receiver,  using  the  notation  [R.~l]T  = 


Since  the  asymptotic  efficiency  is  independent  of  wt(j)  for  (l,j)  ± ( k,i ),  the 
asymptotic  efficiency  is  equivalent  to  the  near-far  resistance.  This  means  that  it  is 
theoretically  possible  to  achieve  near-far  resistance  even  in  the  presence  of  imperfect 
matched  filtering.  However,  note  that  Tl  is  unknown  since  the  propagation  delays  are 
unknown.  In  other  words,  we  do  not  have  the  necessary  information  to  form  v.  One 
possible  approximation  to  v would  be  to  use  ir  = 


1 


1 


vT7Zx 


(5.34) 


5.5  Numerical  Results 


In  the  above  it  was  shown  that  the  decorrelating  receiver  based  on  v is  not  near-far 
resistant.  It  is  therefore  of  interest  to  evaluate  just  how  sensitive  this  receiver  is  to 
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ctt  G {0.01,0.02,0.03,0.04,0.05,0.10} 


Near-far  ratio  [dB] 


Figure  5.1:  Average  asymptotic  efficiency  for  decorrelator  using  ir\  (o)  and  ideal 
decorrelator:  solid  line. 


imperfect  propagation  delay  estimates.  In  this  section  we  will  evaluate  the  probability 
of  error  and  asymptotic  efficiency  of  the  decorrelating  receiver  for  different  near-far 
scenarios  and  estimation  error  variances.  Because  of  the  complicated  nature  of  (5.25) 
and  (5.32),  we  resort  to  Monte-Carlo  simulations. 

The  simulated  system  was  a K = 3 user  system  using  Gold  code  sequences  with 
15  chips  per  bit.  Each  user  transmitted  N = 5 bits.  The  propagation  delays  were 
(randomly)  chosen  to  be  n = 2.6253,  r2  = 8.7261,  and  r3  = 4.5129,  where  the  chip 
duration  was  Tc  — 1.  The  propagation  delay  estimates  were  assumed  to  be  unbiased 
and  to  be  independent  Gaussian  random  variables  with  the  same  standard  deviation 
crT,  i.e.,  E[ffc]  = Tfc  and  E[(ffc  — rk)2]  = a2.  The  unbiased  assumption  is  valid  if  the 
estimator  is  allowed  to  observe  the  signal  for  longer  than,  say,  50  bit  intervals  (see 
Section  4.2.5).  The  assumption  of  equal  standard  deviation,  however,  is  not  valid  if 
wk(i)  / Wi(j).  Nevertheless,  this  assumption  was  made  to  facilitate  the  simulations. 

The  average  asymptotic  efficiency  of  the  first  user’s  third  bit,  771,3,  is  plotted  in 
Figure  5.1.  The  near-far  ratio  is  defined  as  wt(j)/wi(  1)  for  l ^ 1,  and  wk(i)  was 
held  constant  during  the  data  transmission.  The  asymptotic  efficiency  was  averaged 
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Figure  5.2:  Average  error  probability  for  the  decorrelator  using  ir  for  aT  e 

{0.01,  0.02,  0.03,  0.04,  0.05,  0.1}:  (o),  single-user:  solid  line,  and  ideal  decorrelator: 
dash-dotted  line. 

over  1,500  independent  realizations  of  the  propagation  delay  estimates.  The  solid 
line  represents  77^3  for  the  ideal  decorrelator  when  fk  = rk.  Recall  that  the  ideal 
decorrelator  has  an  asymptotic  efficiency  that  is  independent  of  the  near-far  ratio. 
The  asymptotic  efficiency  for  the  decorrelator  based  on  v decreased  with  the  near-far 
ratio  as  predicted  by  (5.32).  Furthermore,  it  is  seen  that  the  larger  aT  was,  the  worse 
the  asymptotic  efficiency  became. 

Similar  conclusions  regarding  the  average  error  probability  are  drawn  from  the 
plots  presented  in  Figure  5.2.  The  probability  of  error  for  the  first  user’s  third  bit  was 
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averaged  over  500  independent  realizations  of  the  propagation  delay  estimates.  The 
solid  line  represents  the  single-user  system,  i.e.,  Q(yj2wx{i)/NQ),  and  the  dash-dotted 
line  is  the  error  probability  of  the  ideal  decorrelator.  The  performance  penalty  is 
seen  to  have  increased  with  the  near-far  ratio  and  with  aT.  As  predicted  from  (5.25) 
and  (5.33),  the  error  probability  of  the  decorrelator  exhibited  an  error  floor,  i.e., 
an  irreducible  error  as  the  SNR  goes  to  infinity.  The  position  of  the  error  floor  is 
determined  by  the  near-far  ratio  and  crT. 

5.6  Conclusions 

We  have  demonstrated  that  the  decorrelating  receiver  will  not  be  near-far  resis- 
tant in  the  presence  of  imperfect  propagation  delay  estimates.  The  performance  will 
be  impaired  by  both  intersymbol  interference  and  multiple  access  interference.  To 
maintain  acceptable  performance,  we  must  keep  the  variance  of  the  estimation  errors 
small.  This  constraint  becomes  more  strict  as  the  power  of  the  interfering  users  be- 
come higher.  Conventional  propagation  delay  estimation  does  not  work  too  well  in  a 
near- far  situation,  and  we  might  therefore  need  to  use  more  sophisticated  estimation 
algorithms. 


CHAPTER  6 

SYSTEM  MODEL  FOR  FADING  CHANNELS 


In  this  chapter  we  will  present  a model  for  a fading  channel  and  derive  the  equiv- 
alent discrete-time  complex  vector  model. 


6.1  Continuous-Time  Channel  Model 


We  will  adopt  the  channel  model  proposed  by  Turin  [54],  The  model  is  intuitively 
pleasing  and  has  been  shown  to  provide  a good  fit  to  measured  channels  [53,  14].  The 
channel  for  the  kth  user  is  modeled  as  a time-varying  filter  with  impulse  response 

Mr>  t), 

Rk(t) 

MD  *)=  Y «*,rW<5(r  - Tktr(t))  (6.1) 

r=0 

where  S(t)  is  the  Dirac  delta  function,  Rk(t)  is  the  number  of  paths  (or  rays),  ak>r(t) 
is  the  path  gain  (or  fading  process),  and  Tk<r(t)  is  the  propagation  delay.  We  assume 
that  the  channel  is  slowly  varying  compared  with  the  time  for  which  we  are  observing 
the  received  signal.  In  particular,  we  assume  that  the  number  of  paths  and  the 
propagation  delays  are  fixed,  i.e.,  Rk(t)  = Rk  and  r*>r(f)  = rkiT. 

The  channel  of  the  kth  user  is  depicted  in  Figure  6.1.  The  rth  ray  is  the  trans- 
mitted signal  delayed  with  Tk<r  and  multiplied  with  the  fading  process  ak,r(t).  The 
contribution  to  the  received  signal  from  user  k can  be  written  as 


rk(f)  = Y ar,k(t)sk(t  - rfc>r 


) \[Ypk  cos(u;ct  + Qk  r) 


(6.2) 


where  0k>r  = 9'k  - u}cTktr,  and  where  sk(t)  is  defined  by  (2.1).  We  assume  that  the 
delays  are  such  that  rfc,r  e [0,  T)  for  k = 1, 2, . . . , K and  r = 1, 2, . . . , Rk,  and  that 
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the  propagation  delays  for  a particular  user’s  rays  are  spaced  with  more  than  7). 
Furthermore,  we  will  make  the  following  assumptions: 

• {^k,r (t') } j for  k = 1,2, ...  ,K  and  r = 1,  2, . . . , Rk,  are  independent  wide-sense 
stationary  random  processes. 

• The  fading  processes  vary  slowly  in  time  compared  with  the  symbol  duration. 
That  is,  ar,k(t)  » ak,r (mT),  for  t G [mT,  (m  + 1 )T). 

• The  noise,  carrier  phase,  data,  and  fading  processes  are  statistically  indepen- 
dent. 


Figure  6.1:  Channel  model  for  the  fcth  user 


6.2  Discrete-Time  Complex  Vector  System  Model 

The  received  signal  can  be  written  as 

K Rk  

r(t)  = n(t)  + J2  Uk,r{t)sk(t  - Tk,r) \/2Pk  cos (uct  + 9ktr)  (6.3) 

A:=l  r=  1 

where  n(t)  is  a white  Gaussian  noise  waveform  with  two-sided  power  spectral  density 
No/2. 
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The  receiver  front-end  is  the  same  as  in  the  AWGN  channel  case,  i.e.,  a stan- 
dard IQ-stage  followed  by  an  integrate-and-dump  section,  as  depicted  in  Figure  2.1. 
Ignoring  double  frequency  terms,  the  equivalent  complex  received  sequence,  r{l)  — 
ri{ l)  + may  be  written  as 

K Rk  i -ij'. 

r{l)  = n(l)  + J2'EM™dQN)~  / sk(t  - r*>r)  dt  (6.4) 

fc=l  r=l  -T  J(l-\)Ti 

where 

A,r(m)  = aktr(mT)y/Fkexp(jektr)  (6.5) 

and  where  n(l ) is  a white  Gaussian  sequence  with  variance  a2  = E[|n(l)|2]  = A/g/7). 
The  received  vector,  r(m)  e GQAr,  and  the  noise  vector,  n(m)  G GQiV,  are  defined 
by  (2.5)  and  (2.6),  respectively.  The  noise  vector  will  have  the  same  distribution  as 
in  the  AWGN  case,  i.e.,  n(m)  is  a complex  Gaussian  random  vector  with  zero  mean 
and  second  moment  matrices 

E[n(p)n*(g)]  = a2IQN6K(p  - q),  E[n(p)nT(q>)]  = 0 (6.6) 

After  some  straightforward  calculations,  we  can  formulate  the  contribution  from 
the  A;th  user  to  r(m)  as  r k(m)  = Aksk(m),  where 


Ak  = [ atll  • • • 3ikaRk  ] G IR^*2^  (6.7) 

sfc  (m)  = [ sM  (m)  • • • sk> 2Rk  (m)  ]T  G G2i?fc  (6.8) 

The  elements  of  s k(m)  are 

Sk,2r-i{™)  = PkAm)z2k-i(rn)  (6.9) 

Sk,2r(m)  = (3k,r(m)z2k(m)  (6.10) 


where  z2k-i(m)  and  z2k(m)  are  defined  by  (2.9)  and  (2.10). 
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The  columns  of  Ak  are  the  contributions  from  the  kth  user’s  Rk  rays  to  r(m). 
For  instance,  if  dk(m)  = dk(m  - 1),  the  /cth  user’s  rth  ray  will  contribute  the  vector 
Pk,r(m)dk(m)a.k  2r-i-  On  the  other  hand,  if  dk{rn)  = —dk(m  — 1),  the  contribution 
will  be  (dkj(m)dk(m) aki2r.  The  form  of  ak}2r  and  a^r-i  are  defined  by  rktT  and  bk(t) 
as 


afc,2r-l 


afc,2r 


|fD  (pt,r  + 1,1)+  f 

~jT^(Pk,r  + 1)  “I)  + 


D (pk,r,  1) 


where  Tfc)J.  = pk>rTi  + 6k<r  is  such  that  pkj  is  an  integer  and  6kyT  E 
ck  and  D(r,  s)  are  defined  in  (2.14)  and  (2.16),  respectively. 

The  received  vector,  r(m),  may  be  expressed  as 


(6.11) 

ck  (6.12) 

[0,7)),  and  where 


r(m)  = As(m)  + n(m) 


(6.13) 


where 


A = [ A)  A2  •••  Ak]  e ]rQNx2R 
s(m)  = [sf(m)  s 2{m)  ■■■  s^(m)]T  E <C2R 


and  where  R is  total  number  of  rays 

R=J2Rk 

k= 1 


(6.14) 

(6.15) 


(6.16) 


CHAPTER  7 

PARAMETER  ESTIMATION  FOR  FADING  CHANNELS 


In  this  chapter,  the  estimation  problem  from  Chapter  4 is  generalized  to  fading 
channels.  The  extension  to  fading  channels  would  be  trivial  if  the  fading  processes, 
{ov(t)},  were  constant  in  time.  However,  this  is  seldom  the  case  in  practice.  There 
are  basically  two  ways  in  which  we  can  extend  the  problem. 

The  first  approach  is  to  assume  certain  distributions  for  the  fading  processes, 
define  a set  of  parameters  that  determines  these  distributions,  and  augment  the  pa- 
rameter vector  from  the  AWGN  case  with  those  parameters.  The  problem  with  this 
approach  is  that  we  must  assume  particular  distributions  for  the  fading  processes. 
This  choice  is  dependent  on  the  application  and  channel.  Moreover,  even  if  we  con- 
centrate on  a certain  environment  and  application,  the  choice  of  distributions  is  not 
obvious  [17]. 

The  second  approach  is  consider  members  of  the  set 

{A,rW  : m = 1, 2, . . . , M;  k = 1, 2, . . . , K;  r = 1, 2, . . . , Rk}  (7.1) 

to  be  unknown  and  deterministic  parameters.  The  parameter  vector  is  then  formed 
as  the  r vector  augmented  with  a2  and  the  elements  of  {/ %,r{m )}.  The  advantage 
with  this  approach  is  that  we  do  not  need  to  choose  any  particular  distribution  for 
the  fading  processes,  and  the  treatment  is  thus  more  general.  The  disadvantage  is 
that  the  parameter  vector  is  much  larger  now.  In  addition  to  a2  and  r,  we  have 
MKR  complex  parameters  in  {/ 3fc>r(ra )}.  Note  that  the  number  of  parameters  is 
proportional  to  the  length  of  the  observation  interval.  This  implies  that  the  size  of 
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the  Fisher  information  matrix  will  grow  with  the  observation  interval.  Nevertheless, 
we  will  adopt  this  approach  due  to  its  generality. 

The  original  work  of  this  chapter  is  (1)  the  formulation  of  the  parameter  esti- 
mation problem  for  fading  channels,  (2)  the  derivation  of  the  Cramer-Rao  bound, 
(3)  the  formulation  of  the  maximum-likelihood,  approximative  maximum-likelihood, 
and  MUSIC  estimators,  and  (4)  the  derivation  of  the  optimum  dimension  reduction 
transform. 


7.1  Cramer-Rao  Bound 

The  derivation  of  the  Cramer-Rao  bound  (CRB)  is  somewhat  lengthy,  and  we 
therefore  defer  the  details  to  Appendix  A. 

The  basic  assumptions  made  for  the  AWGN  case  in  Section  4.1  are  made  here 
as  well;  a 2 and  rktT  are  assumed  to  be  unknown  and  deterministic  parameters  that 
are  fixed  during  the  observation  interval.  We  assume  that  rfc>r  ^ pT{  for  an  in- 
teger p,  which  guarantees  that  A is  differentiable  with  respect  to  rfc,r  (see  Sec- 
tion A.l).  Furthermore  we  assume  that  {pk,r(m)}  are  unknown  and  deterministic 
for  m = 1, 2, . . . , M,  k = 1, 2, . . . , K,  and  r = 1, 2, . . . , Rk. 

We  denote  the  real  part  of  a complex  quantity  x by  x and  the  imaginary  part  by 
x.  The  parameter  vector  <j>  e ]R1+2RM+r  [s  defined  as 


<t>  = 

[a2  (3t  tt]t 

(7.2) 

where 

P = 

= [£T(1) 

~T  - T 

0 (1)  0 (2) 

Pt(2)  •••  pT(M) 

0T(M)  f s IR2KM  (7.3) 

(3(m)  = 

= [0  fM 

02  (m)  ■■■ 

(3TK(m)]T  enR 

(7.4) 

AbM  = 

= lPk,i(m) 

Pk,  2{m)  ■ ■ ■ 

Pk,Rk{m)]T  G IRKfc 

(7.5) 
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and 


r = [rj  tt2  •••  t£]tgIR*  (7.6) 

Tk  = [rk,  1 rK 2 • • • ]T  G IR^  (7.7) 


The  Fisher  information  matrix,  J G ]R(1+2HM+iJ)x(1+2fiM+«))  js  found  as 


J = E 


/ <91nL(r)\  / dlnL(r)' 

V 50  J { d<t> 


(7.8) 


where  lnL(r)  is  the  log-likelihood  function  (conditioned  on  z = [zr(l)  • - -z T(M)]T) 
°f  r = [rT(l)  • • -rT(M)  ]T  with  respect  to  (f>.  Thus,  the  resulting  CRB  should  be 
interpreted  as  being  conditioned  on  both  the  data  sequences  and  the  fading  processes. 
It  is  easy  to  show  that  the  likelihood  function  conditioned  on  z is 

1 / 1 M \ 

= [27T a*)MQN  exp  £ llrM  - As(m)||2J  (7.9) 

and  the  log-likelihood  function  (ignoring  constants  independent  of  <f>)  is 

1 AT 

lnL(r)  = - MQNlna 2 - — £ ||r(m)  - As(m)||2  (7.10) 

° m=  1 

In  Appendix  A it  is  shown  that 


dlnL(r) 
da 2 

dlnL(r) 

d/3(m) 

<91nL(r) 

<9/3  (m) 

<91nL(r) 

dr 


MQN  1 " ..  , ...s 


cr^  cr4 


m=l 


Re  {Z*(m)A*n(m)} 


a' 


Im  {Z*(m)A*n(m)} 


2 M 

— ^ Re  {T*  (m)\I>*n(m)} 

° m=  1 


where  Vl>  G IRQ^*2^  is  defined  as 


(7.11) 

(7.12) 

(7.13) 

(7.14) 


^ = tyK] 


(7.15) 
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**  = [ Jfe,2  • • • </>jfc,2**  ] € lRQiVXjR* 

, <9afc2r-1  1 r , 

^fc,2r-l  = —fit = TF  [D(PA:,r  + 1,  1)  ~ D(Pfc,r,  1)]  Ck 

u 1 k,r  J-i 

'/’Mr  = ^ = 4 [D(p*,r  + 1,  -1)  - D(PI,,  -1)1  Ct 

U • k,r  i 


(7.16) 

(7.17) 

(7.18) 


and  where  Z t(m)  E {-1,0, 1}2R**R‘,  Z(m)  £ {-1,0,  l}1***,  rt(m)  € (C2*****,  and 
r(m)  £ (D2R  ’ 11  are  defined  as 


<9s(  77l) 

Z(m)  = aft,  x = diag(z1(m), . . . , Z*(m)) 
' z2k-i(m)  if  * = 2j  - 1 

22fc(m)  if  i = 2) 

0 otherwise 

T(m)  = diaglTitm), . . . , rK(m)) 

sk,i(m)  if*  e {2.7 -1,27} 


(Z  fc(m))(t,i)  = < 


(7.19) 

(7.20) 

(7.21) 

(7.22) 


(r*(m))(i,j)  = _ 

0 otherwise 

The  Fisher  information  matrix,  J,  can  now  be  formed  by  taking  the  expected  value 
of  all  possible  outer  products  of  the  gradients  (7. 1 1)-(7. 14) . After  some  manipulation 
we  conclude  that 


J = 


o o 


0 Jss  J ST 


0 


JT  T 

JST  JTT 


(7.23) 


where 


Jss  = diag(Jss(l), . . . , JSS(M)) 


Jss(m)  = — 


Re{Z*(m)A*AZ(m)}  - Im{Z*(m)A*AZ(m)} 
- Imr{Z*(m)A*AZ(m)}  Re{Z*(ra)A*AZ(m)} 


(7.24) 

(7.25) 


and 


jSx  = [jrT(i) 


J L(M)}1 


(7.26) 
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Jsr(m)  = 


Re{Z*(m)A*^r(m)} 

Im{Z*(m)A*\Err(m)} 


M 


Jrr  = ^ £ Re{r*(m)^*^r(m)} 


(7.27) 


(7.28) 


To  obtain  the  CRB  for  an  estimator  of  t,  we  invert  the  Fisher  information  matrix 

0 0 

x (7.29) 


J-L  = 


MQN 

0 x 


0 x CRB(r) 

where  CRB(r)  € URxR  is  the  lower  right-hand  R x R block  of  J"1.  In  (7.29),  x 
denote  matrices  that  of  no  particular  interest  to  us.  It  is  shown  in  Section  A. 2 that 


CRB(r)  = [JTT  - JfTJss1Jsr]  1 < E[(f  - r)(f  - r)T]  (7.30) 

where  f is  an  unbiased  estimator  of  r.  After  some  algebraic  manipulations  we  obtain 

2 M 

CRB-1(t)  = -•£  Re{r*(m)<rPiz(m)*r(m)}  (7.31) 

u m= 1 

where  P^Z(m)  is  the  projection  matrix  onto  [range (AZ(ra) )]x, 

PAZ(m)  = Iqjv  - AZ(m)(Z*(m)A*AZ(m))’1Z*(m)A*  (7.32) 

We  will  now  show  that  the  CRB  is  independent  of  the  near-far  problem,  i.e.,  the 
CRB  for  an  estimator  of  the  propagation  delays  for  kth  user’s  rays  is  independent  of 
the  received  power  for  the  other  K — 1 users.  To  show  this,  consider  the  square-root 
of  the  mean  power  of  the  kth  user’s  rth  ray, 

Qk,r  = VE[I/Mm)|2]  (7.33) 

and  let  Q e JRfixit  be  defined  as 


Q = diag(Q1,Q2,...,Q*) 

Qfc  = diag (gktU  qk>2, . . . , qKR J e URkXRk 


(7.34) 

(7.35) 
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Moreover,  let  rnorm(77i)Q  — r(77i).  Now,  if  all  rays  arrive  with  unit  mean  power,  the 
CRB  is 

2 M 

CR-Bnorm(r)  = — - ^ R-e{rnorm(m),®r  PAz(m)^Pnorm(^)}  (7.36) 

u m= 1 

From  (7.31)  and  (7.36)  we  see  that 

CRB(r)  = Q_1CRBnorm(r)Q-1  (7.37) 

and  from  this  equation  we  conclude  that  the  CRB  on  an  estimator  of  a particular 
ray’s  propagation  delay— given  by  the  appropriate  diagonal  element  of  CRB(r) — 
is  independent  of  the  mean  power  of  the  other  rays.  In  other  words,  the  CRB  is 
independent  of  the  near-far  problem.  This  result  is  important  since  it  tells  us  that 
there  may  exist  near-far  resistant  estimators  of  r. 


7.2  Estimation  Algorithms 

The  algorithms  presented  here  are  rather  straightforward  modifications  of  the 
estimators  proposed  in  Section  4.2. 


7.2.1  Maximum-Likelihood 


The  maximum-likelihood  (ML)  estimator  for  the  fading  channel  case  is  completely 
analogous  to  the  ML  estimator  for  the  AWGN  channel  presented  in  Section  4.2.1.  The 
ML  estimator  is  found  by  minimizing  the  negative  of  the  log-likelihood  function  (7.10) 
with  respect  to  z,  defined  by  (4.23),  and  </>, 


r - ML 

0 

~ML 


M 

= argmin  ^ ||r(m)  - As(m)||2 

, TO=  1 


(7.38) 


The  optimization  of  the  ML  cost  function  is  carried  out  over  1 + 2 MR  + R contin- 
uous parameters  in  0,  and  KM  discrete  parameters  in  z.  The  complexity  is  therefore 
enormous,  even  for  rather  modest  values  of  the  observation  interval  and  number  of 
users.  We  therefore  conclude  that  the  ML  estimator  is  of  small  practical  value.  We 
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also  note  that  the  ML  estimator  cannot  reach  the  CRB  as  long  as  the  oversampling 
factor  Q < oo.  The  reason  for  this  is  that  we  cannot  make  consistent  estimates  of  /3 
with  a finite  Q,  regardless  of  the  length  of  the  observation  interval  [46]. 

7.2.2  Approximative  Maximum-Likelihood 


We  can  significantly  reduce  the  complexity  of  the  ML  estimator  by  omitting  the 
dependency  of  the  log-likelihood  function  on  z.  This  can  be  done  in  the  same  manner 
as  in  Section  4.2.2,  by  considering  s(m)  to  be  continuous,  unknown  and  deterministic. 
Note  that  the  structure  of  s(m)  implied  by  (6.9)  and  (6.10)  is  then  not  used.  We  can 
form  the  approximative  maximum-likelihood  estimate  of  r as 


-AML 

T = 


M 


= argmin  ^ ||r(m)  - s(m) 

m=  1 

= arg  min  trace 


s(m)=At  (r)r(m) 


(7.39) 


where  Pa(t)  — I QN  - A(r)Af(r)  is  the  orthogonal  projection  matrix  onto  the  sub- 
space [range(A(T))]x,  and  where  RM  is  the  sample  correlation  matrix 

l M 

= T7  Y,  r(m)r*(m)  (7.40) 

1V1  m=l 

7.2.3  MUSIC 


The  MUSIC  cost  function  for  fading  channels  is  identical  to  the  MUSIC  cost 
function  for  the  AWGN  channel: 


_ ||E;bfc)1(r)||2  ||E;bfc,2(r)||2 

‘•'fc.MlH'  ) — — , ,,r, r 


.|b*.rW||2 

where  the  bfcji(r)  and  bfci2(r)  are  defined  as 


IK2(r)||2 


bfc,i(r)  = 

bfcl2(r)  = 


-D(p  + l,l)+  l--jD(p,l) 


Cfc 


-D(p  + 1,-1)+  1--  D(p,-1) 


c k 


(7.41) 


(7.42) 

(7.43) 
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where  t pT \ + <5  is  such  that  p is  an  integer  and  6 = [0,  2}).  E„  is  found  from  the 

eigenvalue  decomposition  of  RM, 

Rm  = ESASE*  + E„A„E;  (7.44) 

where  the  columns  of  En  are  the  eigenvectors  corresponding  to  the  QN  - 2 R smallest 
eigenvalues  of  RM. 

The  set  T of  candidate  estimates  of  rk  is  formed  exactly  as  in  Section  4.2.3, 
and  the  elements  of  the  MUSIC  estimate  ffu  are  found  as  the  elements  of  T that 
corresponds  to  the  Rk  smallest  values  of  Jk,Mv{j). 

The  only  significant  difference  between  this  algorithm  and  the  MUSIC  estimator 
for  AWGN  channels  is  that  the  noise  subspace  here  has  dimension  QN-2R,  and  that 
we  pick  Rk  estimates  from  T ■ Recall  that  in  the  AWGN  case,  the  noise  subspace  has 
dimension  QN  — 2K,  and  that  we  pick  only  one  estimate  from  T. 

7.2.4  Numerical  Results 

The  simulated  system  was  a 2-user  system  with  Q = 1,  N = 15  chips  per  bit, 
SNRi  = 2 Ebyl/N0  - 15  dB,  and  2 rays  per  user.  The  rays  from  each  user  were 
assumed  to  have  the  same  (unit)  mean  power.  The  fading  processes  are  set  to  be 
the  standard  Rayleigh  fading  processes  with  a Doppler  frequency  of  2.60  x 10~3/T  = 
83.3  Hz  [19,  12].  This  Doppler  frequency  would  result  if  the  system  had  a carrier 
frequency  of  900  MHz,  a data  rate  of  32  kbit/s,  and  a transmitter-receiver  relative 
speed  of  100  km/h. 

For  each  Monte-Carlo  run,  independent  realizations  of  {dk(m),  ak}T(m),  n(m)} 
were  used.  The  propagation  delays  were  (randomly)  set  to 

r = Tc  [3.70  5.20  4.12  8.16]T  (7.45) 

The  received  vector  was  observed  for  M = {200,  300,  400,  500}  symbols,  and  estimates 
of  rM  were  computed  by  the  MUSIC  estimator.  The  variance  of  the  MUSIC  algorithm 
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Figure  7.1:  Average  CRB:  (o),  variance  of  the  MUSIC  estimator:  (x). 


was  estimated  as  the  sample  variance  found  from  103  independent  Monte-Carlo  trials. 
Since  the  Cramer- Rao  bound  is  dependent  on  the  realizations  of  dfc(m)  and  akr(jn), 
the  CRB  plotted  in  Figure  7.1  is  the  average  CRB.  The  near-far  ratio,  P2/Pu  was 
varied  from  0 dB  to  20  dB,  in  order  to  investigate  the  near-far  resistance  of  the  MUSIC 
estimator. 

The  average  CRB  and  the  estimated  variance  of  the  MUSIC  estimator  for  different 
near- far  ratios  are  plotted  in  Figure  7.1.  Recall  that  the  CRB  is  invariant  to  the  near- 
far  ratio.  The  plot  indicates  that  the  MUSIC  estimator  has  the  same  property. 

The  strategy  outlined  in  Sections  7.2.3  and  4.2.3  calls  for  searching  all  QN  bins 
T £ \pT{,  (p  -I-  l)Tj)  of  the  cost  function.  The  estimates,  and  f^,  are  picked  as 
the  two  elements  of  T that  corresponds  to  the  two  smallest  values  of  the  cost  function. 
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Figure  7.2:  MUSIC  cost  function  for  M = 300  and  P2/P1  = 0 dB. 

The  procedure  is  illustrated  in  Figure  7.2,  in  which  a typical  cost  function  is  plotted. 
The  circles  indicate  the  members  of  T.  In  this  case,  the  MUSIC  algorithm  will  pick 
the  right  bins,  i.e.,  the  estimates  will  be  close  to  3.7TC  and  5.2TC.  However,  if  the 
cost  function  had  been  more  perturbed,  we  might  have  chosen  the  wrong  bin  for  one 
or  both  of  and  . This  type  of  error,  referred  to  as  an  outlier  error , can  be 
quite  large.  On  the  other  hand,  if  we  choose  the  correct  bin,  or  restrict  the  search  to 
be  inside  the  correct  bin  (as  the  case  will  be  in  tracking),  the  error  is  much  smaller. 
To  make  an  meaningful  comparison  with  the  CRB,  the  outlier  errors  were  excluded 
from  the  data  before  computing  the  variance  of 

The  relative  number  of  outlier  errors  can  be  found  in  Table  7.1.  Note  that  these 
numbers  are  not  very  reliable  since  only  103  Monte-Carlo  runs  were  made.  However, 
the  table  indicates  that  for  large  Ad , there  will  be  only  a few  outliers.  The  reason  why 


103 


outlier  errors  are  frequent  for  small  M is  that  the  ray,  whose  delay  is  to  be  estimated, 
might  be  in  a deep  fade  during  the  entire  observation  interval,  and  the  signal-to-noise 
ratio  is  thereby  made  very  small.  Thus,  the  cost  function  will  be  severely  perturbed 
and  an  outlier  error  may  occur. 


Table  7.1:  Relative  number  of  outlier  errors. 


M = 200 

300 

400 

500 

P2/P1  = 0 [dB] 

0.245 

0.079 

0.025 

0.005 

10  [dB] 

0.310 

0.090 

0.021 

0.008 

20  [dB] 

0.328 

0.113 

0.030 

0.008 

7.3  Dimension  Reduction 

In  Section  4.3,  we  motivated  the  need  for  complexity  reduction  through  dimension 
reduction.  Complexity  is  of  course  a concern  here  as  well.  In  this  section  we  will 
therefore  derive  the  optimum  dimension  reduction  transform  for  fading  channels.  As 
it  turns  out,  the  optimum  transform  is  very  similar  to  the  optimum  transform  for  the 
AWGN  channel  as  presented  in  Section  4.3. 

Recall  that  the  optimum  transform  is  defined  as  a matrix  V e (pQ^xL  with  the 
property  that  the  CRB  on  an  estimator  of 

4>'  = [0T  rT]T  (7.46) 

based  on  the  data  {r(l), . . . , r(M)}  is  the  same  as  the  CRB  on  an  estimator  based 
on  the  data  {V*r(l), . . . , V*r(M)}. 

Suppose  V*V  = IL,  then  the  reduced  dimension  received  vector  may  be  expressed 
as 


rv(m)  = V*r  (m)  = Avs(m)  + n v{m) 


(7.47) 
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where  Ay  — V*A  and  nv(m)  = V*n(m)  is  a complex  random  Gaussian  vector  with 
zero  mean  and  second  moment  matrices 


E[ny(m)riy(A:)]  = V*  E[n(ra)n*(fc)]V  = a2lL5K(k  - m)  (7.48) 

E[nv(m)n£(fc)]  = 0 (7.49) 


Thus,  the  log-likelihood  function  of  ry  — [iy(l)  • • • Ty(M)  ] with  respect  to  (f>  and 
conditioned  on  z (ignoring  constants  independent  of  0)  is 

Y m 

In  L(tv)  = —M L In  a2 J2  IMm)  - AKs(m)||2  (7.50) 

° m= 1 


We  assume  that  the  columns  of  Av  are  linearly  independent  for  all  possible  real- 
izations of  r,  and  we  exclude  the  case  when  5k>r  = 0 for  any  k = 1,2,...,K  and 
r = 1,  2, . . . , Rk.  Then  the  Fisher  information  matrix  for  an  unbiased  estimator 
based  on  rv  will  be  as  in  (7.23),  with  the  appropriate  substitutions,  A t—  Av, 
if!  <-  <bv  = V*^,  and  QN  «-  L: 


r ML 

a4 

0 

0 ' 

j V = 

0 

*JV,ss 

Jy,sr 

_ 0 

T T 

JV,ST 

Jv,TT  . 

(7.51) 


where 


Jv,ss  = diag(Jy,S8(l), . . . , Jy>ss(M)) 


Jv,ss  (m)  = — 


Re{Z*(m)AyAyZ(m)} 

- lmT {Z* (m)AyAvZ(m)} 


and 


(7.52) 


- lm{Z*(m)AyAvZ(m)} ' 
Re{Z*(m)AyAyZ(m)} 


(7.53) 


Jv'.sr  — [ Jv,sr(l)  ‘ ‘ ■ f 


Jy,s  t(^)  — 


<7* 


Re{Z*(m)A^vr(m)} 

.Im{Z*(ro)A^vr(m)} 


* V,T  T 


a 


2 m 

£ Re{r*(m)^®vr(m)} 


m=  1 


(7.54) 

(7.55) 


(7.56) 
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As  noted  in  Theorem  3.5,  if  V is  such  that 


V*V  = lL 


(7.57) 


range([  A \F])  C range(V) 


(7.58) 


then  VV*A  = A,  VV*\F  = \I>,  and 


AyAy  = A*VV*A  = A*  A 


(7.59) 


A*vVv  = A*W*V  = A*^ 


(7.60) 


'Ty’IV  = $*VV*f  = T'*'!' 


(7.61) 


It  is  now  easy  to  see  that  if  V satisfy  (7.57)  and  (7.58),  then  the  Fisher  information 
matrices  (7.51)  and  (7.23)  are  identical  (with  the  exception  of  the  (1,1)  element). 
Thus,  the  CRB  on  estimators  of  </>'  will  be  identical  in  both  cases,  and  V is  optimum. 


In  this  chapter,  we  extended  the  parameter  estimation  results  from  Chapter  4 to 
the  fading  channels  scenario. 

The  Cramer-Rao  bound  was  presented  in  Section  7.1,  and  it  was  shown  that 
the  CRB  on  an  estimator  of  a particular  user’s  propagation  delay  is  independent  of 
the  other  users’  mean  received  powers.  Thus,  the  CRB  is  invariant  to  the  near-far 
problem,  and  there  may  exist  near-far  robust  estimators. 

In  Section  7.2,  the  maximum-likelihood  (ML),  approximative  maximum-likelihood 
(AML),  and  MUSIC  estimators  were  modified  to  function  in  fading  channels.  The  ML 
estimator  will  not  attain  the  CRB  if  the  oversampling  factor  is  finite.  The  Cramer- 
Rao  bound  for  the  fading  channels  might  therefore  not  be  as  tight  as  it  was  for  the 
AWGN  channel. 

In  Section  7.2.4,  it  was  experimentally  shown  that  the  MUSIC  estimator  is  robust 
against  the  near- far  problem.  Because  of  the  highly  nonlinear  nature  of  the  cost 


7.4  Conclusions 
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function,  the  MUSIC  algorithm  is  susceptible  to  outlier  errors.  An  outlier  error 
occurs  when  the  algorithm  picks  the  wrong  bin,  i.e.,  the  estimate  is  chosen  to  be  in 
[pTi,  ( p + 1 )Ti),  where  p is  an  integer,  when  the  true  propagation  delay  lies  outside 
this  interval.  The  resulting  error  can  be  very  large.  However,  by  allowing  the  MUSIC 
estimator  to  observe  the  signal  for  a sufficiently  long  period  of  time,  the  frequency  of 
outlier  errors  can  be  made  negligible. 

The  optimum  form  of  dimension  reduction  is  found  in  Section  7.3.  The  optimum 
transform  requires  knowledge  of  the  A and  \I>  matrices,  which,  in  turn,  are  defined 
by  the  K users’  code  waveforms  and  propagation  delays  for  all  rays.  Thus,  since  the 
propagation  delays  are  unknown,  the  optimum  transform  is  unknown.  Nevertheless, 
we  may  still  be  able  to  use  the  optimum  transform  in  a practical  manner  as  argued 
in  Section  4.3.2. 


CHAPTER  8 

SUMMARY  AND  CONCLUSIONS 


8.1  Summary 

The  near-far  problem  is  a serious  threat  the  performance  of  asynchronous  DS- 
CDMA  communication  systems.  In  a near-far  situation,  the  interference  will  com- 
pletely destroy  the  ability  of  the  standard  receiver  to  detect  a weak  user.  Several 
near-far  resistant  receivers  exist;  however,  they  are  typically  rather  computationally 
complex  and  require  accurate  synchronization.  It  is  therefore  of  great  importance  to 
find  receivers  and  propagation  delay  estimators  that  are  both  near-far  resistant  and 
feasible  for  implementation. 

A convenient  vector  model  for  an  asynchronous  DS-CDMA  system  operating  over 
additive  white  Gaussian  noise  (AWGN)  channels  was  derived  in  Chapter  2.  The 
system  model  reveals  the  underlying  structure  of  the  system  and  is  easily  implemented 
in  software  for  computer  simulations,  e.g.,  in  the  MATLAB  language  [26]. 

The  minimum  mean-squared  error  (MMSE)  receiver  was  defined  in  Chapter  3. 
Using  the  system  model  from  Chapter  2,  it  was  shown  that  the  MMSE  receiver  is 
near-far  resistant  and  is  optimum  when  the  interference  is  negligible.  The  MMSE 
receiver  has  relatively  low  complexity  and  may  therefore  be  implemented.  However, 
if  complexity  is  still  a concern,  we  can  employ  dimension  reduction  to  further  decrease 
the  amount  of  hardware  and  software  needed  for  implementation. 

The  issue  of  dimension  reduction  of  the  MMSE  receiver  was  investigated  in  Sec- 
tion 3.6.  The  optimum  transform — in  the  probability  of  error  sense — was  shown  to 
be  dependent  on  side-information  which  is  not  assumed  to  be  known  to  the  receiver 
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(most  notably,  the  propagation  delays  for  all  users).  Optimum  dimension  reduction 
is  therefore  not  directly  suitable  for  implementation.  Several  suboptimum  schemes 
and  one  asymptotically  optimum  scheme  were  therefore  proposed  in  Section  3.6. 

The  conclusions  drawn  in  Chapter  3 were  that  the  MMSE  receiver  is  attractive 
due  to  its  simplicity  and  near-far  properties,  and  that  significant  dimension  reduction 
may  be  obtained  if  complexity  is  a greater  concern  than  performance. 

To  maximize  performance  we  need  to  synchronize  the  receiver  with  the  transmit- 
ted signals.  It  was  shown  in  Chapter  5 that  the  popular  near-far  resistant  decorre- 
lating  receiver  is  very  sensitive  to  synchronization  errors.  Furthermore,  as  shown  in 
Chapter  4,  the  standard  algorithm  for  propagation  delay  estimation  fails  in  a near- 
far  situation.  This  indicates  the  need  for  accurate  near-far  robust  propagation  delay 
estimation  algorithms. 

The  parameter  estimation  problem  for  AWGN  channels  was  treated  in  Chapter  4. 
Firstly,  we  presented  the  Cramer- Rao  bound  (CRB).  The  CRB  bounds  the  accuracy 
of  any  unbiased  estimator  and  serves  as  the  optimality  criterion.  It  was  shown  that 
the  CRB,  asymptotically  as  the  observation  interval  goes  to  infinity,  is  independent 
of  the  near-far  problem.  Thus,  there  is  no  fundamental  reason  for  why  propagation 
delay  estimators  should  be  sensitive  to  the  near-far  problem.  Secondly,  the  optimum 
maximum-likelihood  (ML)  estimator  was  derived  and  shown  to  be  very  complex.  Two 
suboptimum  algorithms,  the  approximative  maximum-likelihood  (AML)  and  MUSIC 
estimators,  were  therefore  proposed.  These  algorithms  were  experimentally  shown  to 
be  robust  against  the  near-far  problem  and  to  have  similar  performance.  The  MUSIC 
algorithm  is  less  complex  and  should  therefore  be  the  method  of  choice. 

The  complexity  of  the  parameter  estimators  can  be  reduced  by  dimension  reduc- 
tion. The  optimum  linear  dimension  reduction  transform  for  the  parameter  estima- 
tion problem  was  derived  in  Section  4.3.  Significant  reduction  in  complexity  may  be 
attained  by  the  optimum  transform;  however,  the  transform  requires  knowledge  of 
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the  users’  (unknown)  propagation  delays  and  code  waveforms.  Thus,  the  optimum 
transform  is  mostly  of  theoretical  value. 

The  results  mentioned  above  are  for  a system  operating  over  AWGN  channels. 
However,  a radio  channel  can  seldom  be  accurately  modeled  as  an  AWGN  channel. 
In  Chapter  6,  we  therefore  extended  the  system  model  to  fading  channels.  The 
fading  channel  was  modeled  as  a multiray  channel  with  nonuniform  spacing  between 
rays.  This  model  is  very  general  and  has  been  used  by  many  other  researchers  in 
the  field.  Based  on  the  fading  channel  model,  the  parameter  estimation  problem  was 
reformulated  in  Chapter  7.  The  CRB  for  fading  channels  was  presented  in  Section  7.1 
and  shown  to  be  independent  of  the  near- far  problem.  It  may  therefore  be  possible 
to  find  near-far  resistant  estimators  for  fading  channels. 

The  ML,  AML,  and  MUSIC  estimators  were  modified  to  fit  the  fading  channel 
scenario  in  Section  7.2.  The  AML  and  ML  estimators  are  even  more  complex  in  the 
fading  case,  and  furthermore  the  ML  estimator  will  not  be  optimum  if  the  oversam- 
pling factor  is  finite.  The  MUSIC  estimator  is  only  slightly  more  complex  in  the 
fading  case  and  was  experimentally  shown  to  be  robust  against  the  near-far  problem 
in  Section  7.2.4. 

Dimension  reduction  for  the  parameter  estimation  problem  can  be  employed  for 
fading  channels  as  well,  and  the  optimum  transform  was  derived  in  Section  7.3.  The 
optimum  transform  requires  basically  the  same  side-information  as  in  the  AWGN 
case — i.e.,  the  users’  code  waveforms  and  propagation  delays — plus  knowledge  of  the 
number  of  rays  in  each  user’s  channel. 

8.2  Contributions 

The  main  contributions  made  in  this  dissertation  were: 

• Introduction  of  the  discrete-time  complex  vector  model  which  reveals  the  un- 
derlying structure  of  a DS-CDMA  system  and  facilitates  computer  simulations. 
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• Proofs  of  the  behavior  of  the  MMSE  receiver  when  the  noise  or  interference 
disappear. 

• Derivation  of  the  optimum  dimension  reduction  transform  for  the  MMSE  re- 
ceiver. 

• Formulation  of  several  practical  dimension  reduction  transforms  for  the  MMSE 
receiver  that  outperform  previously  known  schemes. 

• Analysis  of  the  sensitivity  of  the  decorrelating  receiver  to  propagation  delay 
estimation  errors. 

• Derivation  of  the  Cramer-Rao  bound  for  both  AWGN  and  fading  channels. 

• Formulation  of  several  practical  near-far  robust  propagation  delay  estimators 
for  both  AWGN  and  fading  channels. 

• Derivation  of  the  optimum  dimension  reduction  for  the  parameter  estimation 
problem  for  both  AWGN  and  fading  channels. 

8.3  Future  Work 

There  are  many  natural  extensions  to  the  problems  studied  in  this  dissertation. 

In  Chapter  3,  we  described  the  linear-conjugate-linear  (LCL)  modification  to  the 
MMSE  receiver.  This  modification  is  very  attractive,  since  it  gives  better  performance 
for  a lower  complexity.  It  is  therefore  of  interest  to  extend  the  results  for  the  standard 
MMSE  receiver  to  the  LCL-MMSE  receiver. 

The  MMSE  receiver  was  analyzed  under  the  assumption  that  the  “adaptive  al- 
gorithm” in  Figure  3.2  can  find  the  Wiener-Hopf  solution  exactly.  In  reality,  this 
will  not  be  exactly  true.  However,  in  the  AWGN  case,  we  can  make  the  deviation 
from  the  Wiener-Hopf  solution  arbitrarily  small  by  using  a least-mean  squared  (LMS) 
algorithm  with  a sufficiently  small  step  size  [15].  In  fading,  we  ideally  would  like  for 
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the  MMSE  receiver  to  update  its  receiver  vector  to  fit  the  interference  situation  on 
a vector-sample-by-vector-sample  basis.  This  means  that  the  adaptive  filter  has  to 
track  a time- varying  solution.  However,  for  typical  channels,  the  scenario  changes  too 
fast  for  the  LMS  algorithm.  Modification  to  the  signaling  format  or  to  the  adaptive 
algorithm  may  therefore  be  required.  To  the  author’s  knowledge,  no  successful  modi- 
fication has  yet  been  proposed.  Attempts  have  been  done  by  using  differential  BPSK 
and  the  recursive  least-squares  (RLS)  algorithm  [3,  62].  Differential  BPSK  is  used  to 
decrease  the  sensitivity  of  the  receiver  to  the  time- varying  phase  of  the  desired  signal, 
and  the  use  of  the  RLS  algorithm  is  proposed  in  order  to  improve  tracking.  However, 
even  though  RLS  has  faster  convergence  than  the  LMS  algorithm  in  a stationary  en- 
vironment, it  is  well-known  that  RLS  has  roughly  the  same  tracking  ability  as  LMS 
in  a nonstationary  environment  [15]. 

Another  complication  with  the  fading  channel  is  that  the  number  of  rays  is  gen- 
erally unknown.  However,  in  Chapter  7,  we  assumed  that  this  number  was  known  to 
the  estimation  algorithms.  In  practice,  we  must  estimate  the  number  of  rays,  e.g., 
by  comparing  the  singular  values  of  the  sample  correlation  matrix  with  a threshold. 
Also,  the  propagation  delays  will  typically  change  with  time.  Naturally  we  would  like 
to  track  these  changes.  Future  work  will  therefore  include  the  study  of  algorithms 
for  estimating  the  number  of  rays  as  well  as  a recursive  formulation  of  the  MUSIC 
algorithm  for  tracking. 

The  optimum  transform  for  dimension  reduction  shows  the  potential  for  signif- 
icant complexity  reduction.  However,  since  the  optimum  transform  is  unknown,  it 
is  of  interest  to  formulate  practical,  but  probably  suboptimum,  dimension  reduction 
transforms  and  study  their  performance. 


APPENDIX  A 

DERIVATION  OF  THE  CRAMER-RAO  BOUND 

For  a general  discussion  of  the  Cramer- Rao  bound  (CRB),  see  Scharf  [40]  or  Van 
Trees  [55].  The  derivation  of  the  CRB  for  the  AWGN  channel,  in  Section  A.l,  and  for 
the  fading  channels,  in  Section  A. 2,  follows  closely  the  procedure  presented  by  Stoica 
and  Nehorai  [46], 

A.l  CRB  for  the  AWGN  Channel 

We  start  by  repeating  some  key  equations  from  Chapter  2.  The  received  vector 
for  the  mth  data  interval  is 

r(ra)  = A(t)B(7,  0)z(ra)  + n(m)  (A.l) 

where  11(777,)  is  a complex  Gaussian  random  vector  with  zero  mean  and  second  moment 
matrices 


E[n(p)n*(9)]  = a2lQN6K(p  - q)  (A.2) 

E[n(p)nr(9)]  = 0 (A. 3) 


Conditioned  on  0 and  z(m),  the  received  vector  r(m)  will  be  complex  Gaussian  vector 
with  mean 


E[r(m)  | 0,z(t7j)]  = 

ABz(m) 

(A- 4) 

and  second  moment  matrices 

E 

(r (p)  ~ ABz(p))(r(9)  - ABz(g))* 

0,  z(m)J  = ct21qN5}<(p  — q) 

(A.5) 

E 

(r (p)  ~ ABz(p))  (r (q)  - ABz (?))T  | 

4>,z(m)  =0 

(A.6) 
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The  probability  density  function  for  r(m)  conditioned  on  4>  and  z(m)  is  called  the 
likelihood  function  for  r(m)  and  is  denoted  by  L(r(m)).  The  likelihood  function  can 
be  written  as 


i(r(m))  = ^"det1(gnew)exp  (^l|r(m)  - ABzMII2) 

= (TO‘)g«  exp  (~~?llr(m)  ~ ABz(m)||2)  (A.7) 

Since  r(p)  and  r (q)  are  uncorrelated  complex  Gaussian  random  vectors  for  p ^ q,  the 
likelihood  function  of  r = [rT(l)  rr(2)  • • • r r(M)  ]T  may  be  written  as 

M 

£(0  = n L(rM)  (a. s) 

m=  1 

Taking  the  natural  logarithm  of  L( r)  and  ignoring  terms  that  are  independent  of  cf) 
yields  the  log-likelihood  function 

i M 

lnL(r)  = —MQNlna2 V'  Ilr("r)  — ABz(m)||2  (A. 9) 

a m= i 

A. 1.1  Differentiating  the  Log-Likelihood  Function 


We  are  now  ready  to  start  computing  the  gradient  d\n  L(r)/d(f).  We  will  do  this 
in  four  steps  by  computing  the  gradient  of  lnL(r)  with  respect  to  a2,  7,  6,  and  r, 
respectively,  and  then  form  dlnL(r)/d(f)  as 


dlnL(r) 

d(f) 


d In  L(r) 
da 2 

dlnL(r) 

07 

dlnL(r) 

dd 

dlnL(r) 


(A.10) 


dr 

The  information  matrix  (4.4)  can  be  expressed  as  a 4 x 4 block  matrix  where  each 
block  is  the  expected  value  of  the  outer  product  of  two  gradients.  For  instance,  the 
(2,  3)  block  is 


(d\nL\  fd\nL\T 

V d~i  ) V dQ  ) 


J -yO  — E 


(A. 11) 
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Note  that  the  information  matrix  is  real  symmetric  (and  positive  semidefinite),  and 
therefore  the  (3,  2)  block  is  simply  the  (2,  3)  block  transposed. 

We  will  find  the  following  results  useful. 

Result  1 Let  x G C"  be  a function  of  0 = [9X  62  ■■■  dm]T  G IRm,  and  suppose 

A G <Dnxn  is  a constant  matrix.  Then 


d_ 

dO 


'dx 

dO 


x*  Ax  =1^1  Ax  +(^1  Arx 


'dx\T 

dO 


where 


dx  _ dx  dx  dx 

dO  = ld^  W2  8K 


G <Dr 


(A.  12) 


(A- 13) 


Result  2 Let  x G C"  be  a function  of  6 G IRm,  and  suppose  a G C™  is  a constant 
vector.  Then 


o_  * _ (dA' 

do*  a (dflj  a 

s . _ ( dx.\T  - 
aeax  lae)  a 


(A. 14) 
(A. 15) 


Result  3 Let  X G Cnxfc  be  function  of  9 G IR,  and  suppose  a G (Cn  is  a constant 
vector.  Then 

|a-x‘xa  = a-(f)'xa  + a-X-(f)a  (A.16) 

Computing  din L(r)/da2 

We  start  by  differentiating  the  log-likelihood  function  with  respect  to  er2, 

3 In L(t)  MQN  1 **  x , s 

= Zj—  + ^En  (m)n(m)  (A.17) 


da 2 


aH 


m= 1 


Computing  dlnL(r)/d7 

Straightforward  differentiation  with  respect  to  7*.  yields 

= ~~2  £ ^ (r(m)  “ ABz(m))*  (r(m)  - ABz(m)) 
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i M 
m=  1 


f \ * 

— A^— z(m)J  n(m)  + n*(ra) 


Af^z(m> 


^S2Re{(ASz(ra)) n(m)} 

i M f f)Ti*  ) 

— Y 2 Re  < z*(m)— — A*n(rn)  > 


m=l 


where 


air 

57^ 


(*»i) 


dlk 

' exp (~jdk)  if  i = j = 2k  - 1 
exp(-j0fc)  if  i = j = 2k 
0 otherwise 


Now, 


<9B* 

z*(m)- — A*n(m)  = exp(-j9k)  [z2k-i(m)  z2k(m )] 
oik 


and  the  gradient  with  respect  to  7 can  be  expressed  as 


x2k—\ 


x2k  J 


n(m) 


91„L(r)=J_  “ 2Re 


£>7 


m=l 


exp(-j^)  [zi(m)  z2(m)] 


.exp(-jOK)[z2K-i{m)  z2K(m )] 


n(m) 


1 M 

— Y, : 2Re{z*(m)B;A*n(m)} 


X2K- 1 


X2K  J 


n(m) 


m=l 


where  B7  E (p2ft'x2ft'  js  defined  as 


B7  = diag(exp  (j  9\ ) , exp  (j  6\ ) , exp(jdK),exp(jdK)) 


and  Z(m)  E { — 1,0,  l}2KxK  is  defined  as 

' z2k-i(m)  \ii  = 2k-\ 
(Z(m))(i,k)  = < z2k(m)  if  * = 2k 
0 otherwise 


(A.18) 


(A.19) 


(A. 20) 


(A.21) 


(A. 22) 


(A. 23) 
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Computing  d\nL(r)/dd 

The  derivation  above  for  the  gradient  with  respect  to  7 works  here  as  well,  if  we 
make  the  substitution  dB/dyk  <-  dB/dOk.  The  form  of  <91nL(r)/d0  will  be  similar 
to  the  form  of  <91n  L(r)/<97, 

<91nL(r)  1 ™ 

- de  = ~2  £ 2Re{Z*(m)B;A*n(m)}  (A.24) 

u m— 1 

where  Be  e (£2Kx2K  [s  defined  as 


B9  = diagtiPujPujfoJfa,  ■ ■ -JPkJPk)  = j B (A.25) 


Computing  d\nL(r)/dr 

Straightforward  differentiation  yields 


dlnL(r) 

drk 


= £ -n(m)*~Bz(m)  - z*(m)B*^n(m) 

a m=  1 OTk  OTk 

1 ^ f 7 * * \ 

= -5E2RT'WB'lr”H 

° m=  1 l °Tk  J 


Recall  the  expression  for  the  columns  of  A, 


(A. 26) 


a2k-l  = 


a2  k — 


£D(pk  + l,l)+(l-£)D(pk,l) 


c k 


'S, 

T, 


5k 


-D(p*  + l,-l)+h  -^lD(pfc>-l) 


ck 


(A.27) 
(A. 28) 


As  seen  from  these  equations  the  derivative  of  A with  respect  to  Tk , or  equivalently 
with  respect  to  5k,  might  not  exist  when  5k  = 0.  The  reason  for  this  is  that  a.2k  resides 
on  a straight  line  between  D(p*  + 1,  -l)c*  and  D (pk, -l)ck  when  rk  € \pkTu  (pk  + l)Tj 
and  between  D (pk  + 2,  — l)c*  and  D(pfc  + 1,  -l)cfc  when  rk  E [(p*  + 1)7;,  (pk  + 2)7-). 
Now  if  D(p*;,  — l)c*;,  D (pk  + 1,—  l)c;t,  and  D (pk  + 2,-l)cfc  are  not  collinear,  then 
the  derivative  will  not  exist  for  rk  = (pk  + 1)7).  The  Cramer-Rao  bound  is  not 
valid  if  the  derivative  does  not  exist  (see,  for  instance,  Van  Trees  [55]).  In  order  to 
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proceed,  we  therefore  make  the  assumption  that  6k  ± 0 for  k = 1, 2, . . . , K.  This  is 
a rather  mild  restriction,  since  the  assumption  only  implies  that  we  are  not  sampling 
in  synchronization  with  any  user.  It  now  follows  immediately  that 


dA 

drk 


[0 


0 


<9a 


■2k -1 


da 


■2k 


dn 


drk 


0 


0] 


Now  if  we  define 


then 


V>2  k = 
. dA 


da 


[2k-l 


drk 

da2k 


- ^r[D(lA  + !,!)-  D (pk,  l)]cfc 


1 


= Pk[z2k~i(m)  z2k{m )] 


and  the  gradient  with  respect  to  r can  be  expressed  as 

Pi[zi(m)  z2(m)\ 

<91nL(r) 


dr 


1 M 

<TZ  , 
m=  1 


r2k- ! 

* 

2k 

w 

l>2. 


n(m) 


n(m) 


Pk  [z2K-\{jn)  z2K(m )] 


j M 

= -2  E 2 Re {Z*(m)B*T,*n(m)} 


Xl}2K-\ 
Xi)2  K 


n(m) 


crz  — ' 

m=  1 


where  \1>  e jp^Q^x2/c  ig  dgfineci  as 


(A. 29) 


(A. 30) 
(A.31) 

(A. 32) 


(A. 33) 


* = [V>i  ^2  •••  ^2K  ] (A. 34) 

A. 1.2  Forming  the  Fisher  Information  Matrix 

We  have  now  computed  all  the  gradients  needed  to  form  the  Fisher  information 
matrix.  To  continue,  we  will  need  a few  results  (see  [46]  for  proofs).  Suppose  n(m) 
is  defined  as  in  (2.6). 


Result  4 


E[n*(m)n(m)iT(A;)n(A;)] 


( QN)2o 4 if  m ^ k 

QN(QN  + l)cr4  if  m = k 


(A. 35) 
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Result  5 For  all  m,  k, 

E[n*  (m)n(m)nT  (k)]  = 0 (A. 36) 


Result  6 Suppose  x,  y e (CQN,  then 

Re{x}  Re{yT}  = ^ [Re{xyT}  + Re{xy*}]  (A.37) 

Im{x}  Im{yT}  = ~ [Re{xyT}  - Re{xy*}]  (A.38) 

Re{x}  Im{yT}  = ^ [lm{xyT}  - Im{xy*}]  (A.39) 


The  (1,1)  block  of  the  Fisher  information  matrix  is 


E 


731nL(r)\2' 

= E 

[\  da2  ) j 

(MQN)2  2 MQN  ^ „ w N 
— L n (rn)n(m) 


a ' 


a 


m=  1 


M M 


+ ~s  Y X;n*(m)n(m)n*(A:)n(fc) 

° m=lfc=l 

(. MQN )2  2MQN 


CP 


- MQN  a 2 


+ 


MQN 


^ [MQN(QN  + l)cr4  + (M2  - M)(QN)2a‘ 


(A. 40) 


We  note,  by  using  Result  5,  that  d\n  L(r) /dcr2  is  uncorrelated  with  all  other 
gradients.  Thus,  the  (1,2),  (1,3),  and  (1,4)  blocks  of  the  information  matrix  will  be 
zero. 

To  find  the  remaining  blocks  of  the  information  matrix,  we  need  to  compute  the 
expected  values  of  all  possible  outer  products  of  the  gradients  (A. 21),  (A. 24),  and 
(A. 33).  The  amount  of  algebra  required  can  be  reduced  by  noting  that  the  gradients 
are  all  of  the  form 

2 M 

~ Y Re{F*(m)n(m)} 

(T* 

u m=  1 


(A.41) 
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where  F(m)  is  some  QN  x K complex  matrix.  The  general  expression  for  a block  i 
the  information  matrix  is  then 


in 


E 


M 


M 


2 1V1  ( 2 

Zz  £ Re  {Fi(m)n(m)}  ( — £ Re  {F*2(k)n(k)} 


T-, 


m=  1 


k= 1 


^ M M y 

= T4  £ £ o E[Re{Fl(m)n(m)(F2Wn(^))T} 

u m=l  fc=l  z J 

+ Re{F*(m)n(m)(F;(A:)n(A:))*}] 

2 M M 

= ^4  XI  £ {FKm)((j2I QN&K{k  - m))F2(fc)] 


m= 1 fc=l 


2 ^ 

= 52  T Re{F;(m)F2(m)} 


m=l 


(A. 42) 


The  blocks  of  J,  as  found  in  (4.5)  (4.10),  now  follows  by  identifying  the  corresponding 
F(m)  in  (A. 21),  (A. 24),  and  (A. 33),  and  substituting  the  result  into  (A. 42). 

It  appears  to  be  difficult  to  simplify  the  expression  for  the  information  matrix  any 
further.  However,  as  the  observation  period  becomes  large,  i.e.,  as  M — > oo,  some 
simplifications  can  be  made.  The  blocks  of  the  information  matrix  in  (4.4)  are  of  the 
form 

Re  { Z*(ra)HZ*(ra)  1 (A.43) 

lm=l  J 

where  H is  some  2 K x 2 K complex  matrix.  This  can  be  exploited  as  follows.  The 
(b  j)th  element  of  Z*(ra)HZ(m)  is  z*Hz j where  zk  is  the  kth  column  of  Z(m), 


z*;  = [0  •••  0 z2k-\{m)  z2k(m)  0 •••  0] 


Thus, 


z*HZj  = [Z2i-i(m)  z2i(m)  ] H. 


v 


z2j-i{m ) 

z2j(m)  _ 


(A. 44) 


(A. 45) 


where  H„  6 (C2x2  is  defined  as 


H ij  = 


H(2*  — 1, 2j  — 1)  H(2*  — 1, 2j) 
H(2z,  2j  — 1)  H(2i,  2j) 


(A. 46) 
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Expanding  (A. 45)  yields 

z’Hz  j = Z2i-i(m)z2j-i(m)llij(l,  1)  + z2i{m)z2j-i(m  )Hy(2, 1) 

+ z2i-i(m)z2j(rn)liij(l,  2)  + z2i(m)z2j(m)llij(2,  2)  (A. 47) 

As  M — * oo, 

1 M 

aJ™,  S *p(mH(m)  = EfeH^(m)]  (A. 48) 

m=l 

and,  since  di(m)  and  dj(m ) are  zero-mean  and  independent  if  i ^ j and  since  di(m) 
and  di(m  — 1)  are  independent, 

E[z2i_i(m)z2j-i(m)]  = -SK(i  - j)  (A. 49) 

E [z2i(m)z2j(m)]  = -SK(i  - j ) (A.50) 

E[z2t-i(rn).2:2j(m)]  = 0 (A. 51) 

Now, 

Ai  M 

Ita  E <Hz,  = T(Hy(l,  1)  + H„(2, 2 ))SK(i  - j)  (A. 52) 

m=l  " 

It  is  not  hard  to  see  that 

Ai  M 

^lim^  Z*(m)HZ(m)  = — G*  diag(H)G  (A.53) 

m=l  z 

where  G is  defined  as  in  (4.20).  In  other  words,  (A.53)  is  a diagonal  K x K matrix 
whose  A;th  diagonal  element  is  (H(2&;  - 1,2/c  — 1)  + H(2k,2k))M/2.  Note  that  if 
X,  B G (&2Kx2K  and  if  B is  diagonal  such  that  B(2A;  — 1, 2k  — 1)  = B(2A;,  2k),  then 

G*  diag(B*XB)G  = G’  diag(X)G-G*B*BG  (A.54) 

2 

This  is  exactly  the  form  of  the  blocks  of  J.  For  instance,  consider 

2 M 

J«y«y  oo  = lim  J-y-v  = lim  — Y'  Re  (Z*(m)BIA* AB7Z(m)| 

7 7’  M-400  7T  M— >oo  a2  , 1 v 7 7 7 V 2 j 

m=  1 
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= ~ Re  {G*  diag(B*  A*AB7)G} 

= ^ Re  {g*  diag(A*A)G^G*B*B7G  j 
M 

= — Re  {G*  diag(A*A)G}  (A.55) 


since  B*B7  = I 2k  and  G*I2;rG  = 2\K.  Furthermore,  B*B0  = jr | B | , B*B  = |B|, 
BgB0  = |B|2,  B^B  = — j | B | 2 , and  B*B  = |B|2,  where  we  by  |B|  denote  the  matrix 
whose  elements  are  |B|(z,j)  = |B(i,j)|.  The  blocks  of  J as  M ->■  oo  can  now  be 
written  as 


•^70,00  — hm  JyO  — 0 
M->  00 

M 1 

J7TOO  = AlimoJ7r  = ^G*diag(A*¥)G-G*|B|G 
= ^G*  diag(A*\F)G\/P 
J00iOO  = Urn  Jee  = ^G*diag(A*A)G^G*|B|2G 

M— >00  1 


M 

= — G*diag(A*A)GP 


^Ot, 00  — lim  J qt  — 0 

M—too 

JTTt00  = ilim_  JTT  = ^G*diag(**tf)G^G*|B|2G 

(j  & 


M—t  00 


M 

= — G*diag(^*^)GP 


(A. 56) 


(A.57) 


(A. 58) 
(A. 59) 


(A. 60) 


where  the  power  matrix  P G IRKxK  is  defined  by  (4.21).  The  expression  (4.15)  for 
asymptotic  information  matrix  now  follows. 


A. 2 CRB  for  Fading  Channels 

We  will  now  repeat  the  CRB  derivation  for  the  fading  channel  model  as  described 


in  Chapter  6. 


122 


A. 2.1  Differentiating  the  Log-Likelihood  Function 

It  is  easy  to  show  that  the  log-likelihood  function  (conditioned  on  z)  of  r with 
respect  to  <f>  is 


1 M 

lnL(r)  = — MQN\na2  — — E ||r(m)  — As(m)|| 


m=  1 


(A.61) 


Computing  3 In  L(r)/da2 

We  start  by  differentiating  the  log-likelihood  function  with  respect  to  a2, 


31nL(r) 

da2 

Computing  3 In  L(r)//3(m) 
Consider 

31nL(r) 
3/3  (m) 


1 1 


M 


= -MQN—  + ^ E llnH 


m= 1 


1 3 


a 


' d/3(m) 


|r(m)  - As(m) 


With  the  above  Results  1,  2,  and  3 in  mind,  we  can  easily  show  that 


31nZ/(r)  1 

3/3  (m)  a2 


3s (m)  \ T / / 3s(m)  \ * 

' ' Arn(m)  + ( ) A*n(m) 


= —z2  Re 


3/3  (m) 

^ ds(m) 


v3/3(m) 


A*n(m) 


where 


3s(m)  _ 3s(m)  3s(m) 


3/3  (m) 
3s  (ra) 
3/3fc(m) 
3s  (m) 
33fc,r(m) 

Recall  that 


= [ 


d/31(m ) d{32(m) 

3s  (m)  3s  (m) 


3s  (m) 
3^^(m) 
3s(m) 


G C 


2RxR 


3/9fc,i(m)  3^,2  M 34, h*  M 

rQ  ...  0 3sfc,2r-i(m)  3gfc,2r(m) 

^2(r  — 1)  zeros'  9i(m) 


(A. 62) 


(A. 63) 


(A. 64) 


(A. 65) 
(A. 66) 


0lr  e (A.67) 


Sk,2r-i(m)  = /3fc,r(m)22fc_1(m)  (A.68) 

Sk,2r(m)  = 3fc,r(m)22fc(m)  (A. 69) 
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and  therefore 

dskt2r-i(m) 


d 


dp k,r(m)  dPk,r{m) 


[. Pk,r(rn ) + jPk,r(m)\z2k-i(m)  = z2k-i{m) 


and 


dskt2r{m ) 


= z2k{m) 


dpk,r{m) 

If  we  define  Z k[rn)  G { — 1, 0,  l}2^xH*  as 

’ Z2k-i{m)  if  i = 2j  - 1 

(z  k(m))(i,j)  = < 


then 


ds(m) 

dPk(m) 


z2k(m)  if  i = 2j 

0 otherwise 

0 

z k(m) 

0 


where  there  are  \ 2 Ri  leading  zero  rows  in  ds (m)/d(3k(m). 

We  can  now  easily  form  ds(m)/d/3(m)  as 

= diag(Z Pm),  Z2(m), . . . , Z K(m))  = Z(m) 

dpym) 

Computing  d\n  L{r) /d/3(m) 

Similarly  as  for  /3(m),  it  can  shown  that 


and 


Thus, 


<91nL(r)  = J_2  Re  17  ds(m) ' 
d{3(m)  cr2  \dp(m)i 


ds  (m)  ds(m) 
= ]' 


A*n(ra) 


<9/3  (m)  <9/3  (m) 


<9/3  (m ) cr2 


. ( <9s(m) ' 

dP{m)4 


A *n(m) 


= — Im  [Z*(m)A*n(m)] 


(A. 70) 
(A.71) 

(A. 72) 
(A. 73) 

(A. 74) 

(A. 75) 
(A.76) 


(A. 77) 
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Computing  d In  L(r) /dr 

We  will  now  turn  our  attention  to  the  derivative  of  lnL(r)  with  respect  to  r. 
Using  Results  2 and  3,  we  can  show  that 

d\nL(r)  1 T (d A \T  _ (d A \* 

= ~2  S s (m)  I ol  I n(m)  + s (rn)  ( | n(m) 


dr 


k,r 


m=  1 
M 


dr 


fc,r  , 


, 8^k,r 


m=l 


s*(m) 


' <9A 


n(m) 


Consider 


9A 

drk>r 

dAk 


= [0. 


dAk 

0 — — ~ 0 


k— 1 matrices 

[9___S 

2(i — 1)  columns 


drk. 

da 


0 


G JfU^Nx2# 


fc,2r— 1 


9a, 


k,2r 


drktr  W , ' 9rfc,r 

We  define  i/fkt2r-v '•Pk, 2r  G as 

9a^  2r-l 


dr. 


0 


0 


fc,r 


G ]R(5^Vx2fiA: 


drfc,r 


^k,2r  = 


dak; 


2r 


9ri 


fe,r 


It  is  easy  to  see  that 


s*(m) 


dA 


.dr, 


k,r  > 


[ Sfc,2r-1  Sfc,2r  . 


tfU-i 

rk,2r  J 


and  thus 


s*(m) 


'dA' 

drki 


= n(m)n 


where  we  have  defined  r^rn)  G <&2RkXRk  and  ^k  G IR<5'Vx2Rfc  as 

Sk,i(m)  if  i € {2j  - l,2j} 
0 otherwise 

*k  = [*Pk,  1 *Pk,  2 ••• 


(rfc(m))(z,j) 


Finally,  we  can  form 


91nL(r) 

dr 


2 M 

— ^ Re  [r*(m)\f,*n(m)] 


(A. 78) 


(A. 79) 
(A. 80) 


(A.81) 


(A. 82) 


(A. 83) 


(A. 84) 
(A. 85) 


(A. 86) 


m=l 
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where 


r(m)  = diag(r1(m),r2(m),...,IV(m))  G C2Hxfl 

^ = •••  tyK]  e IR^*2* 


(A. 88) 


(A. 87) 


Summary 


As  shown  above,  the  derivatives  of  the  log-likelihood  function  can  be  written  as 


A. 2. 2 Forming  the  Fisher  Information  Matrix 

W e form  the  blocks  of  J by  taking  the  expected  value  of  all  possible  outer  products 
of  the  matrices  (A.89),  (A.90),  (A.91),  and  (A.92). 

Computing  E[(dInL(r)/<9(T2)(-)rl 
The  (1, 1)  block  in  J is 


(A.92) 


(A.91) 


(A.90) 


(A.89) 


o' 


+ Tk  H £E[n*(m)n(m)n*(p)n(p)] 

u m=l  p=l 

( MQN )2  _ 2(MQNl 


o 


<7 


+ — [(M2  - M)(QN)2o 4 + MQN{QN  + l)cr4] 
MQN 


(A. 93) 
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By  using  Result  5,  it  can  be  shown  din  L/ do 2 is  uncorrelated  with  all  other  deriva- 
tives. Thus  the  (l,n)  blocks  (and,  therefore,  the  ( n , 1)  blocks)  of  J will  be  zero. 
Computing  E\(d  In  L(r)/d(3(m))(d  In  L(r) /d(3(p))T] 

Some  straightforward  calculations  using  Result  6 and  the  fact  that  E[n(m)n7  (p]  = 
0 yield 


31nZ/(r) 

fdlnL\T' 

dj3(m) 

l m ) . 

E |Re  [Z*(ra)A*n(m)]  Rer  [Z*(p)A*n(p)]j 


= ~4^  Re  [Z*(m)A*  E(n(m)n*(p)}AZ(p)] 

= — Re[Z*(m)A*AZ(p)]8K(m  — p)  (A. 94) 


Computing  E\(dln  L(r) /d(3(m))(dln  L(r) /d/3(p))T] 

Similar  to  above, 

E din L(r)  f <91nL\ 

_ dp{m)  v Pip)  J 

= Im  [Z*(m)A*  E{n(m)n*(p)}AZ(p)] 

= — ^ Im[Z*(m)A*AZ(p)]5/<-(ra  — p ) (A. 95) 


= — E |Re  [Z*(m)A*n(m)]  ImT  [Z*(p)A*n(p)]j 


Computing  E[(dln  L(r) /dj3(m))(d\n  L(r) /d(3(p))r] 
Again,  as  above, 


dlnL(r) 

fdlnLV' 

d(3(m) 

V P{p)  J 

~ E jlm  [Z*(m)A*n(m)]  Im7  [Z*(p)A*n(p)]| 
Re  [Z*(m)A*  E{n(m)n*(p)}AZ(p)] 


= — Re[Z*(m)A*AZ(p)](5/c(m  - p) 
oz 


d In  L(r) 

( din  L(t)\T 

<9/3  (m ) 

\ Pip)  J . 

(A. 96) 
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Computing  E\(d\nL(r)/d0{m))(d\nL(T)/dr)T 1 
Straightforward  computations  show 


dlnL(r)  / d\nV 
<9/3  (m)  \ dr  / 


= — E ^ Re  [Z*(m)A*n(m)]  Re1 


M 


El"W*‘n(p) 


P=  1 


4 1 M 

= ^2^Re|Z’(m)A'E{n(m)n'(j,)lOTWl 


P=  1 


= ^Re[Z*(m)A**r(m)] 


(A. 97) 


Computing  E\(dln  L(r) /d0(m))(d\n  L(r) /dr)1 
Straightforward  computations  show 


E 


<91nL(r)  / <91nL\ 
3/3  (ra)  \ <9r  ) 


E 


31nL(r)  / 5 In  L 

dr  Vd/3(ra)y 


T 


= — r E < Re 


£7 


M 


Y r*(p)#*n(p) 

P= 1 


Imr  [Z*(m)A*n(m)] 


4 1 M 

= -^5EImT(r,W*-E{n(m)n-(p)}AZ(m)] 

^ P=  1 


= — Im[Z*(m)A**r(m)l 

<7Z 


(A. 98) 


Computing  E[(glnL(r)/gr)(glnL(r)/gr)T1 


The  last  block  of  the  information  matrix  is 


<9 In L(t)  / <9 In L' 
<9r  1 dr 


-E^Re 


M 


Y r*(m)^*n(m) 


.772=1 


Re' 


M 


Er*(^‘n(p) 


p=i 


4 i M M 

= ~\2  Y HRe[r*(m)^*E{n(m)n*(p)}^r(p)] 


772=1  p=  1 


2 ^ 

= -j  E Re[r*(m)^*vhr(m)] 


m=  1 


(A. 99) 
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Assembling  the  Fisher  information  matrix 

We  are  now  ready  to  form  J.  The  information  matrix  is  partitioned 


as 


J = 


J<7(7  J(TS  J <7T 

TT  T T 

° as  USS  Jsr 

Jr  Tr  T 

° OT  dST  J TT  _ 


(A. 100) 


where 


J IT  IT  E 


J(TS  E 


■1  n-r  — E 


J TT  = E 


'dlnL(r)  /ainL(r)\T 

<9<r2  y da2  J 

dlnL(r)  ( <91nL(r)\T 
do2  \ dp  J 

ainL(r)  /<91nZ,(r)\r 

da 2 [dr  ) 

dlnL  f din L\T 


MQN 


0 


= 0 


dr  \ dr  J 
and  where  Jss  and  JST  are  defined  as 

Jss  = E 


2 m 

= — £ Re[r*(m)T'*T'r(m)] 

° m=l 


(A. 101) 
(A. 102) 
(A. 103) 
(A. 104) 


J st  — E 


'dlnL(r) 

fainL(r)\Tl 

d(3 

l dp) 

dlnL(r) 

( <91nL(r)\r 

dp 

V dr  J 

(A. 105) 
(A. 106) 


From  (A. 94),  (A. 95),  and  (A. 96),  we  note  that  the  derivative  of  lnT(r)  with  respect 
to  P(m)  and  (3(p ) are  uncorrelated  when  m ^ p.  Jss  will  therefore  be  block  diagonal 


Jss  = diag(Jss(l),  Jss(2), . . . , JSS(M)) 


where  the  mth  block  is 

2 [ Re[Z*(m)A*AZ(m)]  - Im[Z*(m)A*AZ(m)] 
Jss  (m)  = — 

[ — Imr[Z*(m)  A*AZ(m)]  Re[Z*(m)A*AZ(m)] 
Furthermore,  from  (A. 97)  and  (A. 98)  we  see  that 

Jsr  = [JlT(l)  JfT(2)  •••  J UM)]T 


(A. 107) 


(A. 108) 


(A. 109) 
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where 


JS7-(77!’)  — 


Re[Z*(m)A*«T(m)] 

.Im[Z*(ro)A**T(m)]. 


In  summary,  we  can  write  the  information  matrix  as 


J = 


0 

0 


0 0 


V T 

JST  JTT  J 


(A. 110) 


(A.111) 


A. 2. 3 Inverting  the  Information  Matrix 

The  CRB  matrix  is  found  as  the  inverse  of  the  Fisher  information  matrix.  We 
are  mostly  interested  in  bounding  the  performance  of  a propagation  delay  estimator. 
The  bound  on  the  covariance  matrix  of  such  an  estimator  is  found  as  the  lower  right 
hand  side  R x R block  of  the  CRB  matrix.  In  other  words,  we  can  write  the  CRB 
matrix  as 


J-1^ 


CRB(a2)  0 0 

0 CRB(s)  x 
0 x CRB(r) 


(A. 112) 


where  x denotes  matrices  that  are  of  no  particular  interest  to  us  and  where 


CRB(r)  < E[(f  — r)(r  — r)r] 


(A. 113) 


for  any  unbiased  estimator  f of  the  propagation  delay  vector  r. 

Invoking  a standard  result  on  the  inverse  of  a block  matrix  (see  [40],  Lemma  1, 
page  53),  we  see  from  (A. Ill)  that 


CRB-1(t)  = JTT  - 


(A. 114) 


To  compute  the  last  term  of  this  equation,  we  first  make  a few  definitions  for  nota- 
tional  convenience.  Let 


H(m)  = -Z’(m)A’AZ(m) 


o' 


(A. 115) 
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G(m)  = H~1(m) 
A(m)  = 


Note,  since  H(m)  is  hermitian,  that  Im[H(m)]  must  be  skew  symmetric, 


ImT[H(m)]  = — Im[H(m)] 


With  this  in  mind,  we  can  write  Jf 

,a(m)  as 

'H(m) 

— H(m) 

Jss(m) 

= 

_H(m) 

H(m) 

It  is  easy  to  show  that 

G(m) 

— G(m) 

= 

G(m) 

G(m ) 

Now, 

'JssHl) 

0 

0 

Tr  T-1T  — Tr 

JsrJss  “st  — JST 

0 

Jsl1 

(2) 

0 

0 

0 

Jss  \M) 

M 


m=  1 


M 


^[Ar(m)  AT(m)  ] 

m=\ 

M 

E [Ar(m)  A T{m)] 

71=1 

M 

E[AT(m)  Ar(m)  ] 


’G(ra)  -G(m)'  T A(m) 

_G(m)  G(m)  _ A(m)_ 

G(m)A(m)  — G(m)A  (m) 

_G(m)A(m)  + G(m)A(m)_ 
Re[G(m)A(m)] 

_Im[G(m)A(m)]  _ 

= E Ar(m)Re[G(m)A(m)]  + Ar(m)  Im[G(m)  A(m)] 


m= 1 
M 


m=  1 
M 


E Re[A*(m)G(m)A(m)] 


m—  1 


(A. 116) 
(A. 117) 

(A. 118) 

(A. 119) 
(A. 120) 


(A. 121) 
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Finally,  we  can  find  the  desired  CRB  as 

M 

CRB  j(t)  = JTr  — ^2  Re[A*(m)G(m)A(m)] 

m=  1 
M 

= ERe 

m= 1 
2 ^ 

= s £ Re[r*(m)^’^r(m) 

17  m=l 


^r*(m)'F*^r(m)  - A*(m)G(m)A(m) 


— r*(m)^*AZ(m)(Z*(m)A*AZ(m))-1Z*(m)A*^r(m) 

(A. 122) 

where  Piz(m)  is  the  projection  matrix  onto  [range(AZ(m))]J- 


2 M 

= E Rdr'(m)*’PizWW(m) 


m=  1 


PAZ(m)  = Iqa  - AZ(m)(Z*(m)A*AZ(m))  (A. 123) 
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This  dissertation  was  typed  and  edited  in  Emacs  version  19.25  using  AUC  TeX 
mode  version  9.  li  and  was  typeset  by  M^X2e  version  <1994/06/01>  patch  level  4. 
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will  not  accept  any  role  as  maintainer  of  this  software.  Moreover,  he  will  not  be  able 
to  answer  any  questions  regarding  its  use. 
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